High Quality Resources on GPU Programming/Architecture
☆592Jul 26, 2024Updated last year
Alternatives and similar repositories for gpu-alpha
Users that are interested in gpu-alpha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From the Tensor to Stable Diffusion, a rough outline for a 10 week course.☆1,082Apr 5, 2026Updated 2 months ago
- An ML Systems Onboarding list☆1,083Feb 19, 2026Updated 3 months ago
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆23Apr 7, 2024Updated 2 years ago
- Simple Transformer in Jax☆144Jun 22, 2024Updated last year
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Jul 31, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- some books and papers and stuff☆15Sep 25, 2024Updated last year
- GPU programming related news and material links☆2,162Mar 8, 2026Updated 3 months ago
- Learnings and programs related to CUDA☆437Jun 29, 2025Updated 11 months ago
- From the Transistor to the Web Browser, a rough outline for a 12 week course☆6,515Oct 12, 2021Updated 4 years ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Nov 21, 2024Updated last year
- UNet diffusion model in pure CUDA☆659Jun 28, 2024Updated last year
- Solve puzzles. Learn CUDA.☆12,212Sep 1, 2024Updated last year
- i will automate factorio☆115Jul 31, 2024Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Sep 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- learningggggggg 🐳☆620Apr 2, 2025Updated last year
- Machine Learning Engineering Open Book☆18,056May 18, 2026Updated 3 weeks ago
- A minimal GPU design in Verilog to learn how GPUs work from the ground up☆12,530Aug 18, 2024Updated last year
- papers.day☆93Dec 15, 2023Updated 2 years ago
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year
- llama3 implementation one matrix multiplication at a time☆15,231May 23, 2024Updated 2 years ago
- Tutorials on tinygrad☆478Oct 10, 2025Updated 8 months ago
- LLM101n: Let's build a Storyteller☆37,259Aug 1, 2024Updated last year
- A really tiny autograd engine☆100May 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆33,035Updated this week
- A zero-dependency ML framework in C with a modern Python API for full control over execution and memory.☆687Updated this week
- Puzzles for learning Triton☆2,471Apr 1, 2026Updated 2 months ago
- infinifi plays gentle lofi music in the background indefinitely☆344Nov 26, 2024Updated last year
- Blazingly fast neighborhood attention☆14Nov 28, 2023Updated 2 years ago
- Python tools☆14Oct 22, 2023Updated 2 years ago
- parallelized hyperdimensional tictactoe☆127Aug 25, 2024Updated last year
- A deep-dive on the entire history of deep-learning☆1,554Jul 16, 2024Updated last year
- LLM training in simple, raw C/CUDA☆30,150Jun 26, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a small autograd engine, made purely from numpy and python.☆27Sep 17, 2024Updated last year
- learning & making kernels in cuda / triton☆22Aug 24, 2025Updated 9 months ago
- Let's make sand talk☆590Oct 17, 2023Updated 2 years ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆74Apr 22, 2025Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Jan 2, 2025Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Jul 17, 2024Updated last year
- Cerule - A Tiny Mighty Vision Model☆70Nov 9, 2025Updated 7 months ago