repalash / CUDAClionStarterProjectLinks
CUDA CLion starter project template, with simple vector addition code.
☆26Updated 7 years ago
Alternatives and similar repositories for CUDAClionStarterProject
Users that are interested in CUDAClionStarterProject are comparing it to the libraries listed below
Sorting:
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆53Updated last month
- ☆71Updated 4 months ago
- A Library for fast Hash Tables on GPUs☆132Updated 3 months ago
- CUDA Data Parallel Primitives Library☆438Updated 7 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆420Updated 6 months ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Updated 5 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆29Updated 8 years ago
- BGHT: High-performance static GPU hash tables.☆71Updated 7 months ago
- A warp-oriented dynamic hash table for GPUs☆76Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆114Updated last year
- TaichiCon: Taichi Conferences☆72Updated 3 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 8 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆176Updated 5 years ago
- Frog is Asynchronous Graph Processing on GPU with Hybrid Coloring Model. The fundamental idea is based on Pareto principle (or 80-20 rule…☆36Updated 4 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆290Updated 2 years ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆348Updated 3 years ago
- ☆22Updated 8 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆158Updated 2 years ago
- Parallel Algorithm Scheduling Library☆105Updated 8 years ago
- Stencil Probe - a stencil microbenchmark☆30Updated 13 years ago
- MWE for using the Eigen library in CUDA kernels☆121Updated 3 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 7 years ago
- ☆62Updated 3 years ago
- A gpu based implementation of a K-D Tree Builder☆118Updated 6 years ago
- μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…☆205Updated last month
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆73Updated 10 years ago
- A fast and highly scalable GPU dynamic memory allocator☆112Updated 10 years ago
- iBFS: Concurrent Breadth-First Search on GPUs. SIGMOD'16☆26Updated 8 years ago