zchee / cuda-sampleLinks
CUDA official sample codes
☆370Updated 9 years ago
Alternatives and similar repositories for cuda-sample
Users that are interested in cuda-sample are comparing it to the libraries listed below
Sorting:
- Source code that accompanies The CUDA Handbook.☆527Updated 5 months ago
- CUDA by practice☆129Updated 5 years ago
- CUDA Data Parallel Primitives Library☆432Updated 6 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆748Updated 4 months ago
- ☆552Updated this week
- ☆447Updated 10 years ago
- ☆67Updated 11 years ago
- Source code examples from the Parallel Forall Blog☆1,297Updated 11 months ago
- This is a list of useful libraries and resources for CUDA development.☆571Updated 7 years ago
- Training material for Nsight developer tools☆160Updated 11 months ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆427Updated 2 years ago
- CUDA Kernel Benchmarking Library☆679Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆546Updated 3 weeks ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆413Updated last week
- Example of how to use CUDA with CMake >= 3.8☆70Updated last month
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆106Updated 7 years ago
- Efficient Top-K implementation on the GPU☆181Updated 6 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆531Updated 4 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆132Updated 5 years ago
- matrix multiplication in CUDA☆123Updated last year
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆846Updated 3 weeks ago
- Demonstration of various hardware effects on CUDA GPUs.☆383Updated last year
- CUSP : A C++ Templated Sparse Matrix Library☆413Updated last month
- CUDA Matrix Multiplication Optimization☆201Updated 11 months ago
- Kernel Tuner☆351Updated this week
- Full-speed Array of Structures access☆171Updated 2 years ago
- Online CUDA Occupancy Calculator☆78Updated 3 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆155Updated 2 years ago
- ☆60Updated 2 years ago