AdepojuJeremy / CUDA-120-DAYS--CHALLENGELinks
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆776Updated 7 months ago
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
Sorting:
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆403Updated 8 months ago
- Learnings and programs related to CUDA☆422Updated 4 months ago
- ☆385Updated 6 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆556Updated 4 months ago
- 100 days of building GPU kernels!☆521Updated 6 months ago
- CUDA Learning guide☆461Updated last year
- GPU Kernels☆203Updated 6 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆425Updated 7 months ago
- ☆370Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆643Updated last week
- An ML Systems Onboarding list☆917Updated 9 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated 11 months ago
- Apply GPU in ML and DL☆54Updated last month
- ☆1,791Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated 9 months ago
- learningggggggg 🐳☆552Updated 6 months ago
- High Quality Resources on GPU Programming/Architecture☆589Updated last year
- UNet diffusion model in pure CUDA☆651Updated last year
- GPU programming related news and material links☆1,746Updated last month
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.☆886Updated last year
- Some CUDA example code with READMEs.☆176Updated 7 months ago
- ☆88Updated last week
- Simple MPI implementation for prototyping or learning☆286Updated 2 months ago
- ☆193Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 4 months ago
- Learning about CUDA by writing PTX code.☆145Updated last year
- NVIDIA tools guide☆144Updated 9 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆233Updated 5 months ago
- Visualization of cache-optimized matrix multiplication☆155Updated 7 months ago
- The Tensor (or Array)☆451Updated last year