This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆440Feb 22, 2025Updated last year
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆466Mar 10, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆876Mar 29, 2025Updated 11 months ago
- GPU Kernels☆222Apr 27, 2025Updated 10 months ago
- 100 days of building GPU kernels!☆579Apr 27, 2025Updated 10 months ago
- ☆418Apr 10, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆46May 24, 2025Updated 10 months ago
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆27Jun 25, 2025Updated 9 months ago
- ☆3,401Mar 11, 2026Updated 2 weeks ago
- Building GPT ...☆18Dec 1, 2024Updated last year
- Apply GPU in ML and DL☆62Mar 16, 2026Updated last week
- ☆15Feb 23, 2025Updated last year
- Package for data-driven and phenomenological gravitational waveform models☆12Feb 12, 2025Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆142Dec 30, 2025Updated 2 months ago
- A Wadler–Lindig pretty printer for Python☆46Jan 16, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Puzzles for learning Triton☆2,348Mar 18, 2026Updated last week
- GPU Programming with C++ and CUDA, published by Packt☆75Dec 15, 2025Updated 3 months ago
- Mini CCL - A lightweight collective communication library☆30Jan 2, 2026Updated 2 months ago
- Astronomy Research + JAX Meeting 2024☆13Oct 23, 2024Updated last year
- Learnings and programs related to CUDA☆435Jun 29, 2025Updated 8 months ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆88Mar 19, 2026Updated last week
- Code for the study "Gravitational wave inference on a numerical-relativity simulation of a black hole merger beyond general relativity"☆11Jan 10, 2023Updated 3 years ago
- Poster at ITSC 2024☆20Nov 12, 2024Updated last year
- ☆14May 18, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- GWInferno: Gravitational-Wave Hierarchical Inference with NumPyro☆21Feb 27, 2026Updated 3 weeks ago
- JAX-accelerated nuclear equation of state inference and TOV solvers☆10Mar 13, 2026Updated last week
- Class of High Performance Computing taken at U.T.P 2017☆120Oct 11, 2017Updated 8 years ago
- learningggggggg 🐳☆615Apr 2, 2025Updated 11 months ago
- Performing parameter estimation on gravitational wave data with machine learning☆21Feb 9, 2026Updated last month
- ☆240Jan 2, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,119Aug 26, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Material for gpu-mode lectures☆5,865Feb 1, 2026Updated last month
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Port of Karpathy's micrograd in pure C. Micrograd is a tiny scalar-valued autograd engine and a neural net library on top of it with PyTo…☆34Jul 27, 2024Updated last year
- Make triton easier☆50Jun 12, 2024Updated last year
- GPU programming related news and material links☆2,060Mar 8, 2026Updated 2 weeks ago
- ☆463Dec 18, 2025Updated 3 months ago
- Neural Emulator Architectures in JAX.☆24Feb 20, 2026Updated last month