AdepojuJeremy / CUDA-120-DAYS--CHALLENGELinks
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆752Updated 5 months ago
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
Sorting:
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆382Updated 6 months ago
- Learnings and programs related to CUDA☆418Updated 2 months ago
- ☆367Updated 5 months ago
- 100 days of building GPU kernels!☆499Updated 4 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆508Updated 3 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆408Updated 6 months ago
- CUDA Learning guide☆440Updated last year
- ☆307Updated last week
- GPU Kernels☆193Updated 4 months ago
- An ML Systems Onboarding list☆898Updated 7 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆638Updated last week
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆227Updated 8 months ago
- learningggggggg 🐳☆547Updated 5 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated 9 months ago
- ☆1,426Updated 2 months ago
- GPU programming related news and material links☆1,689Updated 8 months ago
- High Quality Resources on GPU Programming/Architecture☆588Updated last year
- Apply GPU in ML and DL☆53Updated 6 months ago
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.☆744Updated last year
- Some CUDA example code with READMEs.☆172Updated 6 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆127Updated 7 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆95Updated 9 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆53Updated last year
- Canny edge detector implemented in CUDA C/C++☆26Updated 7 months ago
- UNet diffusion model in pure CUDA☆648Updated last year
- ☆497Updated last month
- Visualization of cache-optimized matrix multiplication☆155Updated 6 months ago
- Simple MPI implementation for prototyping or learning☆279Updated last month
- ☆69Updated last week
- ☆256Updated last month