AdepojuJeremy / CUDA-120-DAYS--CHALLENGE
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU parallel programming, memory management, and performance optimization skills.
☆667Updated last month
Alternatives and similar repositories for CUDA-120-DAYS--CHALLENGE:
Users that are interested in CUDA-120-DAYS--CHALLENGE are comparing it to the libraries listed below
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆333Updated 2 months ago
- ☆296Updated 3 weeks ago
- Learnings and programs related to CUDA☆380Updated 2 months ago
- 100 days of building GPU kernels!☆399Updated last week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆342Updated last month
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆268Updated 5 months ago
- ☆247Updated 3 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- GPU Kernels☆172Updated last week
- learningggggggg 🐳☆520Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆550Updated 2 weeks ago
- CUDA Learning guide☆366Updated 10 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆111Updated 2 months ago
- ☆80Updated 3 weeks ago
- An ML Systems Onboarding list☆776Updated 3 months ago
- creating a tiny tensor library in raw C☆680Updated 2 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- Question paper of courses taught at IISC as part of MTech AI curriculum☆62Updated 5 months ago
- High Quality Resources on GPU Programming/Architecture☆586Updated 9 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated last week
- Apply GPU in ML and DL☆52Updated 2 months ago
- UNet diffusion model in pure CUDA☆601Updated 10 months ago
- ☆1,083Updated 3 weeks ago
- High performance hybrid classical-quantum computing learning framework written in C☆443Updated 3 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆169Updated last month
- pytorch from scratch in pure C/CUDA and python☆40Updated 6 months ago
- repo of paper implementations☆19Updated 2 months ago
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆61Updated last week
- a simple CLI command that will create a template of a generic ML Project☆79Updated 7 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆227Updated 9 months ago