CUDA Learning guide
☆540Jun 20, 2024Updated last year
Alternatives and similar repositories for Parallel-Computing-Cuda-C
Users that are interested in Parallel-Computing-Cuda-C are comparing it to the libraries listed below
Sorting:
- NVIDIA tools guide☆164Jan 7, 2025Updated last year
- Read custom dataset☆12Mar 31, 2023Updated 2 years ago
- Setup Cuda☆27May 23, 2024Updated last year
- GPU programming related news and material links☆2,047Mar 8, 2026Updated last week
- Personal notes on CUDA programming☆54Mar 5, 2023Updated 3 years ago
- My study notes and hands-on projects for CUDA-based GPU programming☆10Dec 11, 2025Updated 3 months ago
- Examples from Programming in Parallel with CUDA☆170Feb 5, 2026Updated last month
- Learn CUDA Programming, published by Packt☆1,237Dec 30, 2023Updated 2 years ago
- Solve puzzles. Learn CUDA.☆11,997Sep 1, 2024Updated last year
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆97Aug 14, 2023Updated 2 years ago
- ☆3,383Mar 11, 2026Updated last week
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- GPU Kernels☆222Apr 27, 2025Updated 10 months ago
- Fast low-bit matmul kernels in Triton☆438Feb 1, 2026Updated last month
- ☆15Feb 13, 2018Updated 8 years ago
- Implementation from scratch in CUDA C++ of image processing algorithms.☆22Oct 26, 2020Updated 5 years ago
- Learn CUDA with PyTorch☆253Mar 14, 2026Updated last week
- ☆93Nov 11, 2025Updated 4 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Step by step implementation of a fast softmax kernel in CUDA☆62Jan 6, 2025Updated last year
- ☆14Apr 10, 2023Updated 2 years ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆259Sep 13, 2024Updated last year
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆949Aug 19, 2024Updated last year
- UNet diffusion model in pure CUDA☆656Jun 28, 2024Updated last year
- Material for gpu-mode lectures☆5,841Feb 1, 2026Updated last month
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆876Mar 29, 2025Updated 11 months ago
- ☆462Dec 18, 2025Updated 3 months ago
- High Quality Resources on GPU Programming/Architecture☆592Jul 26, 2024Updated last year
- Fast CUDA matrix multiplication from scratch☆1,100Sep 2, 2025Updated 6 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆253May 6, 2025Updated 10 months ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,442Updated this week
- CUDA Library Samples☆2,346Updated this week
- CUDA Core Compute Libraries☆2,217Updated this week
- Step-by-step optimization of CUDA SGEMM☆445Mar 30, 2022Updated 3 years ago
- ☆91Feb 29, 2024Updated 2 years ago
- Apply GPU in ML and DL☆62Updated this week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆8,953Jan 6, 2026Updated 2 months ago
- ☆19May 17, 2016Updated 9 years ago