tanishqkumar / beyond-nanogptLinks
Minimal and annotated implementations of key ideas from modern deep learning research.
☆1,133Updated 2 months ago
Alternatives and similar repositories for beyond-nanogpt
Users that are interested in beyond-nanogpt are comparing it to the libraries listed below
Sorting:
- Textbook on reinforcement learning from human feedback☆1,221Updated this week
- ☆495Updated last month
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆362Updated this week
- Best practices & guides on how to write distributed pytorch training code☆475Updated 6 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,561Updated 4 months ago
- ☆366Updated 5 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆508Updated 2 months ago
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.☆612Updated last year
- NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025☆401Updated 4 months ago
- Learnings and programs related to CUDA☆418Updated 2 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆638Updated this week
- Leetcode for Pytorch☆1,544Updated last month
- ☆254Updated last month
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆227Updated 8 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆405Updated 6 months ago
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆177Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,720Updated 3 weeks ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆380Updated 6 months ago
- learningggggggg 🐳☆547Updated 5 months ago
- NanoGPT (124M) in 3 minutes☆3,117Updated last month
- 100 days of building GPU kernels!☆494Updated 4 months ago
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,126Updated 7 months ago
- Implementation of all RL algorithms in a simpler way☆1,111Updated 2 weeks ago
- GPU Kernels☆193Updated 4 months ago
- Collection of important articles to be treated as a textbook☆804Updated 2 weeks ago
- ☆484Updated last week
- ☆1,425Updated 7 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,026Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,568Updated 8 months ago