It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.
☆1,433Jun 4, 2024Updated last year
Alternatives and similar repositories for ilya-sutskever-recommended-reading
Users that are interested in ilya-sutskever-recommended-reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM101n: Let's build a Storyteller☆36,559Aug 1, 2024Updated last year
- Free and open-source curriculum to master artificial intelligence☆35Feb 28, 2025Updated last year
- Real-Time equilibrium reconstruction code☆15Mar 20, 2026Updated last week
- ☆1,087Nov 3, 2025Updated 4 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆89,206Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆5,787Mar 20, 2026Updated last week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆77,102Feb 5, 2026Updated last month
- Implement a reasoning LLM in PyTorch from scratch, step by step☆3,620Mar 20, 2026Updated last week
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆24,315Dec 17, 2025Updated 3 months ago
- Rust Solana IDL types definitions de/serializable with serde☆17Jul 17, 2024Updated last year
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆690Jun 18, 2025Updated 9 months ago
- Quick script to convert a git repo into a prompt for analyzing with an LLM☆27Jun 22, 2024Updated last year
- [WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)☆14,356Feb 12, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repo has scripts to compare various powerful RL methods☆39Feb 23, 2026Updated last month
- A tiny deep learning library written in Java☆27Feb 12, 2023Updated 3 years ago
- Neural Networks: Zero to Hero☆21,025Aug 18, 2024Updated last year
- rl from zero pretrain, can it be done? yes.☆290Sep 28, 2025Updated 5 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆275Nov 21, 2024Updated last year
- LLM training in simple, raw C/CUDA☆29,216Jun 26, 2025Updated 9 months ago
- just me trying to implement deep learning concepts in code☆214Nov 8, 2025Updated 4 months ago
- ML from scratch☆2,446Aug 12, 2025Updated 7 months ago
- Machine Learning Systems☆22,933Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆55,432Nov 12, 2025Updated 4 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆108Dec 1, 2024Updated last year
- ☆18Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆15,147Aug 8, 2024Updated last year
- NanoGPT (124M) in 2 minutes☆5,003Mar 17, 2026Updated last week
- Assignments of courses taught at IISC as part of MTech AI curriculum☆141Feb 15, 2025Updated last year
- Machine Learning Engineering Open Book☆17,528Mar 16, 2026Updated last week
- negate_sentence(A Python module that doesn't negate sentences.)☆31Oct 13, 2024Updated last year
- Official code for PLoP☆17Mar 6, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆35Aug 28, 2025Updated 6 months ago
- Master PostgreSQL Internals: A comprehensive deep-dive from SQL parsing to disk I/O. Covers MVCC, WAL, VACUUM, Indexing, and Query Execut…☆47Jan 28, 2026Updated last month
- Temporal Pathway Synthesizer☆17Jun 28, 2024Updated last year
- Minimal reproduction of DeepSeek R1-Zero☆12,963Feb 27, 2026Updated last month
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,903Dec 17, 2025Updated 3 months ago
- Research papers and blogs to transition to AI Engineering☆2,379Nov 19, 2025Updated 4 months ago
- llama3 implementation one matrix multiplication at a time☆15,255May 23, 2024Updated last year