tanishqkumar / beyond-nanogptLinks

Minimal and annotated implementations of key ideas from modern deep learning research.

☆1,084

Alternatives and similar repositories for beyond-nanogpt

Users that are interested in beyond-nanogpt are comparing it to the libraries listed below

Sorting:

natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,147Updated 2 weeks ago
hkproj / 100-days-of-gpu
☆358Updated 3 months ago
Quentin-Anthony / torch-profiling-tutorial
☆447Updated 2 weeks ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆228Updated 7 months ago
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆558Updated this week
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆463Updated 5 months ago
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆383Updated 4 months ago
saurabhaloneai / History-of-Deep-Learning
learningggggggg 🐳
☆541Updated 4 months ago
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆318Updated 2 weeks ago
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,508Updated 3 months ago
rkinas / cuda-learning
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…
☆363Updated 5 months ago
JUSTSUJAY / ML-Research-Papers
☆105Updated 11 months ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,644Updated 3 weeks ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆414Updated last month
tugot17 / pmpp
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆450Updated last month
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆809Updated last week
Exorust / TorchLeet
Leetcode for Pytorch
☆1,442Updated last week
AniruddhaChattopadhyay / Books
☆163Updated last month
1y33 / 100Days
GPU Kernels
☆191Updated 3 months ago
a-hamdi / GPU
100 days of building GPU kernels!
☆477Updated 3 months ago
xbresson / CS5242_2025
NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025
☆396Updated 3 months ago
SkalskiP / vlms-zero-to-hero
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
☆1,113Updated 6 months ago
kmohan321 / LLMs
☆89Updated 4 months ago
dmarx / anthology-of-modern-ml
Collection of important articles to be treated as a textbook
☆787Updated 2 months ago
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆512Updated 3 weeks ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,965Updated 2 weeks ago
Engineer1999 / A-Curated-List-of-ML-System-Design-Case-Studies
This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights …
☆1,496Updated last week
VachanVY / Reinforcement-Learning
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …
☆159Updated last week
neubig / starter-repo
An example starter repo for Python projects
☆294Updated last month
Open-Deep-ML / DML-OpenProblem
☆465Updated 2 weeks ago