rkinas / cuda-learningLinks
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
☆354Updated 4 months ago
Alternatives and similar repositories for cuda-learning
Users that are interested in cuda-learning are comparing it to the libraries listed below
Sorting:
- ☆343Updated 2 months ago
- GPU Kernels☆182Updated last month
- 100 days of building GPU kernels!☆445Updated last month
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆364Updated 3 months ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆697Updated 2 months ago
- Learnings and programs related to CUDA☆407Updated 4 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆185Updated 3 weeks ago
- An ML Systems Onboarding list☆816Updated 5 months ago
- ☆265Updated 5 months ago
- Apply GPU in ML and DL☆52Updated 4 months ago
- ☆174Updated 5 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆271Updated 7 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆555Updated this week
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆182Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆218Updated 5 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆30Updated this week
- The Tensor (or Array)☆436Updated 10 months ago
- ☆1,196Updated 2 months ago
- ☆161Updated this week
- Simple MPI implementation for prototyping or learning☆248Updated 3 weeks ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆281Updated last week
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆189Updated last month
- learningggggggg 🐳☆526Updated 2 months ago
- High Quality Resources on GPU Programming/Architecture☆588Updated 10 months ago
- Alex Krizhevsky's original code from Google Code☆192Updated 9 years ago
- making the official triton tutorials actually comprehensible☆41Updated 3 months ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆150Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆185Updated last year
- Some CUDA example code with READMEs.☆165Updated 3 months ago
- UNet diffusion model in pure CUDA☆608Updated 11 months ago