AdepojuJeremy / CUDA-120-DAYS--CHALLENGELinks
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆836Updated 9 months ago
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
Sorting:
- Learnings and programs related to CUDA☆432Updated 6 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆431Updated 10 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆619Updated 6 months ago
- ☆408Updated 8 months ago
- 100 days of building GPU kernels!☆560Updated 8 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆448Updated 9 months ago
- CUDA Learning guide☆502Updated last year
- GPU Kernels☆217Updated 8 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆664Updated last week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- ☆427Updated 2 weeks ago
- An ML Systems Onboarding list☆960Updated 11 months ago
- learningggggggg 🐳☆563Updated 9 months ago
- Apply GPU in ML and DL☆55Updated 3 months ago
- ☆113Updated 3 weeks ago
- High Quality Resources on GPU Programming/Architecture☆589Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated last year
- GPU programming related news and material links☆1,881Updated 3 months ago
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.☆1,080Updated last year
- UNet diffusion model in pure CUDA☆658Updated last year
- ☆878Updated this week
- Simple MPI implementation for prototyping or learning☆297Updated 5 months ago
- Some CUDA example code with READMEs.☆179Updated last month
- Tutorials on tinygrad☆448Updated 2 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆53Updated last year
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆207Updated 6 months ago
- Learning about CUDA by writing PTX code.☆151Updated last year
- Visualization of cache-optimized matrix multiplication☆157Updated 9 months ago
- Canny edge detector implemented in CUDA C/C++☆27Updated 10 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆140Updated 10 months ago