mark-poscablo / gpu-prefix-sum
CUDA implementation of exclusive prefix sum via Blelloch's algorithm
☆25Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpu-prefix-sum
- Fast CUDA 3x3 SVD☆69Updated 6 years ago
- BGHT: High-performance static GPU hash tables.☆55Updated last month
- ☆56Updated last year
- CUDA implementation of parallel radix sort using Blelloch scan☆61Updated 8 months ago
- EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…☆25Updated 4 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- ☆39Updated 2 years ago
- Code for SIGGRAPH 2022 paper "Automatic quantization for physics-based simulation"☆61Updated 2 years ago
- A symbolic asymptotic numerical solver☆30Updated 3 years ago
- RTX compute samples☆68Updated last year
- A gpu based implementation of a K-D Tree Builder☆96Updated 5 years ago
- Efficient CUDA Stream Compaction Library☆33Updated last year
- Some CUDA design patterns and a bit of template magic for CUDA