repalash / CUDAClionStarterProjectLinks
CUDA CLion starter project template, with simple vector addition code.
☆26Updated 7 years ago
Alternatives and similar repositories for CUDAClionStarterProject
Users that are interested in CUDAClionStarterProject are comparing it to the libraries listed below
Sorting:
- ☆71Updated 4 months ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Updated 5 years ago
- BGHT: High-performance static GPU hash tables.☆71Updated 7 months ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- CUDA Data Parallel Primitives Library☆438Updated 7 years ago
- A warp-oriented dynamic hash table for GPUs☆76Updated 2 years ago
- CUDA implementation of parallel radix sort using Blelloch scan☆67Updated last year
- A Library for fast Hash Tables on GPUs☆132Updated 3 months ago
- Parallel network flows using OpenMP and CUDA.☆28Updated 7 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆29Updated 8 years ago
- Efficient CUDA Stream Compaction Library☆35Updated 2 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- iBFS: Concurrent Breadth-First Search on GPUs. SIGMOD'16☆26Updated 8 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆54Updated 4 years ago
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆53Updated last month
- ☆59Updated 4 months ago
- CUSP : A C++ Templated Sparse Matrix Library☆420Updated 6 months ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆73Updated 10 years ago
- A gpu based implementation of a K-D Tree Builder☆118Updated 6 years ago
- Asynchronous Multi-GPU Programming Framework☆48Updated 4 years ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆348Updated 3 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Parallel cuckoo hashing on GPUs with CUDA☆12Updated 6 years ago
- Programmable CUDA/C++ GPU Graph Analytics☆1,065Updated last week
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆67Updated 10 years ago
- High-Performance Linear Algebra-based Graph Primitives on GPUs☆234Updated 4 years ago
- GPU B-Tree with support for versioning (snapshots).☆51Updated last year
- A matrix and array operation library on GPU with Eigen compatible interface☆99Updated 8 years ago
- TaichiCon: Taichi Conferences☆72Updated 3 years ago
- Implementation of the maximum network flow problem in CUDA.☆31Updated 5 years ago