Rust Threads on the GPU

5 days ago

VectorWare announces successful use of Rust's std::thread on the GPU, enabling developers to write high-performance GPU applications with familiar Rust abstractions.
GPUs execute programs via kernels launched in parallel, differing from CPUs where threads are spawned explicitly; this creates challenges for Rust's safety model.
Rust kernels on GPUs traditionally use unsafe code and raw pointers due to the mismatch between Rust's ownership model and GPU execution, treating kernels like FFI boundaries.
VectorWare's approach maps each std::thread to a GPU warp, starting with only one active warp (like CPU's main thread) and waking others via thread::spawn to mimic CPU concurrency.
This warp-as-thread model prevents divergence, allows Rust's borrow checker to work naturally, and integrates with existing Rust ecosystem libraries like rayon and tokio.
Benefits include abstracting GPU details (warps, blocks), enabling safe concurrency, and supporting both threads and async/await on GPUs for broader Rust code compatibility.
Downsides include finite warp resources, higher synchronization costs, potential underutilization of GPU lanes, and memory constraints for thread stacks.
VectorWare aims to make GPUs behave like normal Rust platforms, focusing on GPU-native applications while supporting multiple languages in future products.

Hasty Briefsbeta