NVIDIA / accelerated-computing-hubLinks
NVIDIA curated collection of educational resources related to general purpose GPU programming.
☆779Updated this week
Alternatives and similar repositories for accelerated-computing-hub
Users that are interested in accelerated-computing-hub are comparing it to the libraries listed below
Sorting:
- NVIDIA Math Libraries for the Python Ecosystem☆522Updated last month
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆819Updated last month
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆877Updated last year
- Fast CUDA matrix multiplication from scratch☆908Updated last month
- NVIDIA tools guide☆144Updated 9 months ago
- Kernel Tuner☆370Updated this week
- CUDA Learning guide☆461Updated last year
- Step-by-step optimization of CUDA SGEMM☆388Updated 3 years ago
- The CUDA target for Numba☆206Updated this week
- GPU programming related news and material links☆1,746Updated last month
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆310Updated this week
- ☆193Updated last year
- CUDA Kernel Benchmarking Library☆753Updated this week
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆247Updated last year
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆233Updated 5 months ago
- Fastest kernels written from scratch☆377Updated last month
- Training material for Nsight developer tools☆170Updated last year
- ☆153Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆460Updated this week
- LeetGPU Challenges☆299Updated last week
- CUDA Matrix Multiplication Optimization☆230Updated last year
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆880Updated 2 years ago
- CUDA Core Compute Libraries☆1,987Updated this week
- ☆121Updated 7 months ago
- Examples from Programming in Parallel with CUDA☆161Updated 2 years ago
- Some CUDA example code with READMEs.☆176Updated 7 months ago
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆69Updated 3 weeks ago
- An ML Systems Onboarding list☆917Updated 9 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆491Updated this week
- ☆591Updated this week