This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆453Feb 22, 2025Updated last year
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆487Mar 10, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆924Mar 29, 2025Updated last year
- GPU Kernels☆225Apr 27, 2025Updated last year
- 100 days of building GPU kernels!☆602Apr 27, 2025Updated last year
- ☆430Apr 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆31Mar 29, 2026Updated 2 months ago
- ☆46May 24, 2025Updated last year
- ☆3,705Mar 11, 2026Updated 3 months ago
- Building GPT ...☆18Dec 1, 2024Updated last year
- Apply GPU in ML and DL☆68Mar 23, 2026Updated 2 months ago
- ☆11Aug 4, 2025Updated 10 months ago
- Package for data-driven and phenomenological gravitational waveform models☆12Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆146May 11, 2026Updated last month
- A Wadler–Lindig pretty printer for Python☆47Apr 20, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Puzzles for learning Triton☆2,485Apr 1, 2026Updated 2 months ago
- Mini CCL - A lightweight collective communication library☆32Jan 2, 2026Updated 5 months ago
- Learnings and programs related to CUDA☆437Jun 29, 2025Updated 11 months ago
- Astronomy Research + JAX Meeting 2024☆13Oct 23, 2024Updated last year
- GPU Programming with C++ and CUDA, published by Packt☆93Dec 15, 2025Updated 5 months ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆99Updated this week
- Code for the study "Gravitational wave inference on a numerical-relativity simulation of a black hole merger beyond general relativity"☆11Jan 10, 2023Updated 3 years ago
- ☆14Mar 29, 2026Updated 2 months ago
- GWInferno: Gravitational-Wave Hierarchical Inference with NumPyro☆21Feb 27, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX-accelerated nuclear equation of state inference and TOV solvers☆15May 25, 2026Updated 2 weeks ago
- Performing parameter estimation on gravitational wave data with machine learning☆21May 4, 2026Updated last month
- Class of High Performance Computing taken at U.T.P 2017☆139Oct 11, 2017Updated 8 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- ☆254Jan 2, 2025Updated last year
- Material for gpu-mode lectures☆6,156May 9, 2026Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,216Aug 26, 2025Updated 9 months ago
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Jun 5, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Make triton easier☆50Jun 12, 2024Updated 2 years ago
- GPU programming related news and material links☆2,162Mar 8, 2026Updated 3 months ago
- Port of Karpathy's micrograd in pure C. Micrograd is a tiny scalar-valued autograd engine and a neural net library on top of it with PyTo…☆35Jul 27, 2024Updated last year
- ☆493Dec 18, 2025Updated 5 months ago
- Neural Emulator Architectures in JAX.☆26Jun 1, 2026Updated last week
- Notes of PRNN course taught at IISC as part of MTech AI curriculum☆19Nov 30, 2024Updated last year
- A generic, composable multi-dimensional array library.☆12May 23, 2026Updated 3 weeks ago