rkinas / cuda-learningLinks
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆432Updated 10 months ago
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below
Sorting:
- ☆409Updated 9 months ago
- 100 days of building GPU kernels!☆561Updated 8 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆635Updated 6 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆453Updated 10 months ago
- GPU Kernels☆218Updated 8 months ago
- Learnings and programs related to CUDA☆432Updated 6 months ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆844Updated 9 months ago
- ☆438Updated 3 weeks ago
- Apply GPU in ML and DL☆55Updated 3 months ago
- An ML Systems Onboarding list☆967Updated 11 months ago
- CUDA Learning guide☆514Updated last year
- Some CUDA example code with READMEs.☆179Updated 2 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆665Updated 2 weeks ago
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆206Updated 7 months ago
- ☆907Updated last week
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Updated last year
- Learn CUDA with PyTorch☆176Updated 3 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 7 months ago
- Simple MPI implementation for prototyping or learning☆299Updated 5 months ago
- ☆233Updated last year
- UNet diffusion model in pure CUDA☆662Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆161Updated last month
- GPU programming related news and material links☆1,906Updated 3 months ago
- ☆116Updated last month
- ☆88Updated 2 months ago
- ☆209Updated last year
- A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do☆169Updated this week
- ☆538Updated 5 months ago
- making the official triton tutorials actually comprehensible☆93Updated 4 months ago