Apple's MLX adding CUDA support
10 months ago
- #CUDA
- #Performance Optimization
- #MLX
- Notifications require signing in to change settings.
- Discussion on adding ROCm support based on CUDA pull request.
- Options for incorporating CUDA backend into MLX, favoring frequent merging.
- Considerations for ROCm and CUDA backend coexistence.
- Ongoing refactoring and experimentation with CUDA backend code.
- Performance analysis and optimizations for CUDA kernels.
- Challenges with kernel launching overhead and potential optimizations.
- Memory management strategies for operands and temporaries in CUDA.
- Collaboration offers for testing on Jetson devices.
- Build instructions and development setup for CUDA backend.
- Reasons for developing a CUDA backend: performance and compatibility.