zchee / cuda-sample
CUDA official sample codes
☆365Updated 9 years ago
Alternatives and similar repositories for cuda-sample:
Users that are interested in cuda-sample are comparing it to the libraries listed below
- Source code that accompanies The CUDA Handbook.☆521Updated last month
- ☆427Updated 9 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆663Updated last month
- CUDA Kernel Benchmarking Library☆593Updated last week
- Source code examples from the Parallel Forall Blog☆1,270Updated 8 months ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆359Updated last week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆401Updated last year
- Training material for Nsight developer tools☆151Updated 7 months ago
- CUDA Data Parallel Primitives Library☆428Updated 6 years ago
- CUDA by practice☆125Updated 5 years ago
- ☆524Updated last week
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆825Updated this week
- Automatically exported from code.google.com/p/opencl-book-samples☆166Updated 5 years ago
- Online CUDA Occupancy Calculator☆74Updated 3 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆264Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆247Updated this week
- Full-speed Array of Structures access☆164Updated last year
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,735Updated last year
- ☆58Updated 2 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- Examples for HIP☆203Updated 3 months ago
- Efficient Top-K implementation on the GPU☆168Updated 5 years ago
- Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)☆126Updated 4 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆297Updated 6 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆411Updated 4 months ago
- Demonstration of various hardware effects on CUDA GPUs.☆365Updated last year
- cuDNN sample codes provided by Nvidia☆45Updated 6 years ago