NVIDIA / accelerated-computing-hubLinks
NVIDIA curated collection of educational resources related to general purpose GPU programming.
☆565Updated this week
Alternatives and similar repositories for accelerated-computing-hub
Users that are interested in accelerated-computing-hub are comparing it to the libraries listed below
Sorting:
- NVIDIA Math Libraries for the Python Ecosystem☆333Updated last week
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆813Updated 10 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆754Updated 4 months ago
- NVIDIA tools guide☆138Updated 6 months ago
- Kernel Tuner☆353Updated this week
- CUDA Kernel Benchmarking Library☆682Updated this week
- The CUDA target for Numba☆149Updated last week
- Fast CUDA matrix multiplication from scratch☆764Updated last year
- CUDA Learning guide☆403Updated last year
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆280Updated last month
- Step-by-step optimization of CUDA SGEMM☆355Updated 3 years ago
- Examples from Programming in Parallel with CUDA☆157Updated 2 years ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆236Updated 10 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆196Updated 2 months ago
- GPU programming related news and material links☆1,616Updated 6 months ago
- Fastest kernels written from scratch☆290Updated 3 months ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆415Updated this week
- ☆168Updated 11 months ago
- Training material for Nsight developer tools☆161Updated 11 months ago
- ☆110Updated 4 months ago
- CUDA Matrix Multiplication Optimization☆202Updated 11 months ago
- collection of benchmarks to measure basic GPU capabilities☆391Updated 5 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆343Updated this week
- ☆554Updated this week
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆53Updated 3 weeks ago
- RAPIDS Memory Manager☆595Updated this week
- Experimental projects related to TensorRT☆107Updated this week
- CUDA Core Compute Libraries☆1,761Updated this week
- A tool for bandwidth measurements on NVIDIA GPUs.☆482Updated 3 months ago
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆845Updated last year