This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆447Feb 22, 2025Updated last year
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆470Mar 10, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆900Mar 29, 2025Updated last year
- GPU Kernels☆223Apr 27, 2025Updated 11 months ago
- 100 days of building GPU kernels!☆587Apr 27, 2025Updated 11 months ago
- ☆425Apr 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆46May 24, 2025Updated 10 months ago
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆28Mar 29, 2026Updated 2 weeks ago
- ☆3,490Mar 11, 2026Updated last month
- Learning records for building a large language model from scratch☆59Jan 1, 2025Updated last year
- Building GPT ...☆18Dec 1, 2024Updated last year
- Apply GPU in ML and DL☆67Mar 23, 2026Updated 3 weeks ago
- ☆15Feb 23, 2025Updated last year
- ☆11Aug 4, 2025Updated 8 months ago
- Package for data-driven and phenomenological gravitational waveform models☆12Mar 27, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆146Mar 30, 2026Updated 2 weeks ago
- Puzzles for learning Triton☆2,359Apr 1, 2026Updated 2 weeks ago
- GPU Programming with C++ and CUDA, published by Packt☆77Dec 15, 2025Updated 3 months ago
- Mini CCL - A lightweight collective communication library☆31Jan 2, 2026Updated 3 months ago
- Learnings and programs related to CUDA☆437Jun 29, 2025Updated 9 months ago
- Astronomy Research + JAX Meeting 2024☆13Oct 23, 2024Updated last year
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆90Updated this week
- ☆14Mar 29, 2026Updated 2 weeks ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX-accelerated nuclear equation of state inference and TOV solvers☆10Updated this week
- learningggggggg 🐳☆616Apr 2, 2025Updated last year
- Class of High Performance Computing taken at U.T.P 2017☆125Oct 11, 2017Updated 8 years ago
- ☆248Jan 2, 2025Updated last year
- Material for gpu-mode lectures☆5,945Feb 1, 2026Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,146Aug 26, 2025Updated 7 months ago
- A pqSNARK with lightweight proofs, powered by the Whir PCS.☆45Sep 11, 2025Updated 7 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 1, 2026Updated last week
- Make triton easier☆50Jun 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GPU programming related news and material links☆2,093Mar 8, 2026Updated last month
- ☆12Nov 10, 2023Updated 2 years ago
- Port of Karpathy's micrograd in pure C. Micrograd is a tiny scalar-valued autograd engine and a neural net library on top of it with PyTo…☆35Jul 27, 2024Updated last year
- ☆469Dec 18, 2025Updated 3 months ago
- A Mathematica package for the numerical solution of ODE eigenvalue problems via a pseudospectral method using the Bernstein basis.☆13Oct 8, 2022Updated 3 years ago
- A repository containing deep learning models and evaluation methods for enhancing medical image segmentation in Computed Tomography (CT) …☆20Jan 20, 2024Updated 2 years ago
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆21Nov 25, 2024Updated last year