rkinas / cuda-learning
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆336Updated 2 months ago
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below
Sorting:
- 100 days of building GPU kernels!☆414Updated 2 weeks ago
- ☆309Updated last month
- GPU Kernels☆174Updated 2 weeks ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆345Updated 2 months ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆671Updated last month
- Learnings and programs related to CUDA☆396Updated 2 months ago
- Apply GPU in ML and DL☆52Updated 2 months ago
- An ML Systems Onboarding list☆780Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated this week
- ☆249Updated 3 months ago
- ☆163Updated 4 months ago
- UNet diffusion model in pure CUDA☆602Updated 10 months ago
- CUDA Learning guide☆367Updated 10 months ago
- CUDA tutorials or Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆182Updated 3 weeks ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆268Updated 5 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆171Updated last week
- ☆1,098Updated last month
- The Tensor (or Array)☆432Updated 9 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆550Updated this week
- GPU programming related news and material links☆1,501Updated 4 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,464Updated 2 months ago
- NVIDIA tools guide☆132Updated 4 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆193Updated 2 weeks ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆493Updated last week
- Slides, notes, and materials for the workshop☆325Updated 11 months ago
- High Quality Resources on GPU Programming/Architecture☆586Updated 9 months ago
- The Autograd Engine☆604Updated 8 months ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆150Updated 11 months ago
- Puzzles for learning Triton☆1,614Updated 5 months ago