nosferalatu / SimpleGPUHashTable
A simple GPU hash table implemented in CUDA using lock free techniques
☆382Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for SimpleGPUHashTable
- Demonstration of various hardware effects on CUDA GPUs.☆358Updated 11 months ago
- A Library for fast Hash Tables on GPUs☆109Updated 2 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆518Updated 5 months ago
- ☆486Updated this week
- CUDA Data Parallel Primitives Library☆421Updated 6 years ago
- A warp-oriented dynamic hash table for GPUs☆71Updated 10 months ago
- stdgpu: Efficient STL-like Data Structures on the GPU☆1,162Updated this week
- This is a list of useful libraries and resources for CUDA development.☆527Updated 7 years ago
- CUDA implementation of parallel radix sort using Blelloch scan☆61Updated 8 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆797Updated this week
- Patterns and behaviors for GPU computing☆1,667Updated 2 years ago
- GPU-Accelerated Lossless Data Compressors Survey☆110Updated 4 years ago
- BGHT: High-performance static GPU hash tables.☆55Updated 2 months ago
- ☆132Updated last year
- CUDA Kernel Benchmarking Library☆519Updated this week
- Enoki: structured vectorization and differentiation on modern processor architectures☆1,261Updated 7 months ago
- A gpu based implementation of a K-D Tree Builder☆96Updated 5 years ago
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆560Updated 2 months ago
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- a CUDA implementation of a priority queue☆81Updated 4 years ago
- RAPIDS Memory Manager☆492Updated this week
- Efficient Top-K implementation on the GPU☆149Updated 5 years ago
- ☆201Updated last month
- Kernel Tuner☆287Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,684Updated last year
- Lossless compressor of multidimensional floating-point arrays☆106Updated 4 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆558Updated 3 weeks ago
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆667Updated last week
- Agenium Scale vectorization library for CPUs and GPUs☆328Updated 3 years ago
- CUDA Core Compute Libraries☆1,278Updated this week