Ahdhn / CUDATemplate
Template for starting CUDA/C++ project using CMake with Github Action for CI
☆29Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CUDATemplate
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Multi-GPU Framework for Voxel Grid Computations☆42Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆42Updated 10 months ago
- Local and distributed octrees based on Morton codes with halo discovery and exchange with a 3D collision detection algorithm☆35Updated last month
- A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers☆14Updated 2 weeks ago
- zenus parallel computing library for zenus physics-based simulations☆79Updated this week
- RTX compute samples☆68Updated last year
- μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…☆149Updated this week
- BGHT: High-performance static GPU hash tables.☆55Updated last month
- C++ library for fast computation of neighbor lists in point clouds.☆54Updated last year
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆20Updated 3 years ago
- Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!☆73Updated 6 months ago
- CUDA kernel author's tools☆107Updated 2 years ago
- ☆22Updated 2 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆25Updated 7 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Examples that demonstrate uses of the OptiX Tookit☆10Updated 5 months ago
- ☆56Updated 2 months ago
- ray tracing implementations. started with Peter Shirley's v2 Ray Tracing In One Weekend☆49Updated 4 years ago
- ☆56Updated last year
- An implementation of parallel exclusive scan in CUDA☆59Updated 6 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆81Updated last year
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆20Updated this week
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- A nanobind example project☆90Updated this week
- cuASR: CUDA Algebra for Semirings☆34Updated 2 years ago
- 'cubicle' - a CUDA-centric BVH query library (kNN, find closest point, etc)☆17Updated last week
- High-performance Geometric Multigrid☆32Updated 5 years ago
- Code for SIGGRAPH 2022 paper "Automatic quantization for physics-based simulation"☆61Updated 2 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆57Updated 4 months ago