AdepojuJeremy / CUDA-120-DAYS--CHALLENGE
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆646Updated 2 weeks ago
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE:
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆315Updated last month
- Learnings and programs related to CUDA☆378Updated last month
- ☆232Updated last week
- 100 days of building GPU kernels!☆336Updated this week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆332Updated last month
- GPU Kernels☆160Updated last week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆251Updated 4 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆544Updated this week
- learningggggggg 🐳☆499Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆214Updated 3 months ago
- ☆239Updated 2 months ago
- Apply GPU in ML and DL☆51Updated last month
- Learning about CUDA by writing PTX code.☆127Updated last year
- High Quality Resources on GPU Programming/Architecture☆585Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆173Updated this week
- An ML Systems Onboarding list☆751Updated 2 months ago
- NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025☆348Updated last week
- Some CUDA example code with READMEs.☆94Updated last month
- CUDA Learning guide☆357Updated 9 months ago
- UNet diffusion model in pure CUDA☆601Updated 9 months ago
- ☆1,027Updated 3 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆227Updated 8 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆91Updated 2 months ago
- Tutorials on tinygrad☆363Updated 3 weeks ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆61Updated 4 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆156Updated 3 weeks ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆347Updated last week
- Canny edge detector implemented in CUDA C/C++☆26Updated 2 months ago
- GPU programming related news and material links☆1,454Updated 3 months ago
- Low-Level Programming Roadmap and Resources☆1,029Updated last week