AdepojuJeremy / CUDA-120-DAYS--CHALLENGELinks
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆724Updated 4 months ago
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
Sorting:
- Learnings and programs related to CUDA☆414Updated last month
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆363Updated 5 months ago
- ☆358Updated 3 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆450Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆558Updated this week
- 100 days of building GPU kernels!☆477Updated 3 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆383Updated 4 months ago
- ☆287Updated 6 months ago
- GPU Kernels☆191Updated 3 months ago
- CUDA Learning guide☆419Updated last year
- Apply GPU in ML and DL☆52Updated 5 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated 7 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆274Updated 8 months ago
- learningggggggg 🐳☆541Updated 4 months ago
- ☆1,334Updated last month
- An ML Systems Onboarding list☆849Updated 6 months ago
- High Quality Resources on GPU Programming/Architecture☆588Updated last year
- Some CUDA example code with READMEs.☆169Updated 5 months ago
- ☆163Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆318Updated 2 weeks ago
- creating a tiny tensor library in raw C☆734Updated 5 months ago
- Canny edge detector implemented in CUDA C/C++☆27Updated 5 months ago
- ☆59Updated last week
- Learning about CUDA by writing PTX code.☆133Updated last year
- Simple MPI implementation for prototyping or learning☆272Updated 2 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated 2 months ago
- repo of paper implementations☆20Updated 5 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆120Updated 5 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆69Updated 8 months ago
- GPU programming related news and material links☆1,648Updated 7 months ago