Building a GPT-like LLM from scratch with PyTorch.
☆336Dec 20, 2024Updated last year
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below
Sorting:
- An introduction to DSPy☆34Aug 30, 2025Updated 6 months ago
- ☆18Feb 27, 2025Updated last year
- ☆10Apr 10, 2014Updated 11 years ago
- Codebase for Cuda Learning☆31Jul 13, 2024Updated last year
- Code, figures, and supplementary materials for the paper "A Harmonic Field Model of Consciousness in the Human Brain". Includes Python s…☆21Dec 1, 2025Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.☆73Jan 6, 2026Updated last month
- ☆63Nov 23, 2025Updated 3 months ago
- An app that extracts your twitter threads into a downloadable CSV file.☆12Apr 8, 2023Updated 2 years ago
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated last month
- This project will help you get started with using the Neurosity Crown sdk with Open AI☆12Aug 5, 2024Updated last year
- Implementations of Papers that I read, you can read my breakdown in my blog☆88Oct 23, 2025Updated 4 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆86,149Feb 19, 2026Updated last week
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params), TinyGPT2 (~95M params). Fast, c…☆15Feb 21, 2026Updated last week
- ☆14Jan 14, 2025Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Nov 22, 2023Updated 2 years ago
- Context Engineering Course with DSPy☆215Jul 27, 2025Updated 7 months ago
- ☆22Aug 27, 2025Updated 6 months ago
- Python and C++ implementation of the Chirp-Z transform☆19Aug 11, 2020Updated 5 years ago
- Building LLMs from scratch following the book from S. Raschka☆33Mar 27, 2025Updated 11 months ago
- Use AI to instantly summarize websites' terms of service and highlight any concerning elements☆17Apr 5, 2025Updated 10 months ago
- Serverless ML Course for building AI-enabled Prediction Services from models and features☆15Oct 19, 2022Updated 3 years ago
- empirically chooses -ngl param for llama.cpp☆17Mar 19, 2025Updated 11 months ago
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated last year
- ☆15May 29, 2022Updated 3 years ago
- A Gentle Principled Introduction to Deep Reinforcement Learning☆19Apr 4, 2025Updated 11 months ago
- Playing around with video object detection using DNN☆19Jan 14, 2021Updated 5 years ago
- Python implementation of the context tree weighting (CTW) method for sequential probability assignment.☆19Sep 1, 2022Updated 3 years ago
- ☆42Apr 30, 2024Updated last year
- ☆33Jul 27, 2025Updated 7 months ago
- ☆24Apr 24, 2025Updated 10 months ago
- 🧡 Hacker News summaries☆22Apr 10, 2024Updated last year
- A virtual agent for your virtual books📚☆48May 18, 2025Updated 9 months ago
- chat-with-docs☆19Nov 28, 2024Updated last year
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆23,193Dec 17, 2025Updated 2 months ago
- llms.txt -> MCP converter and other tools for the adoption of the `llms.txt` standard☆48Jan 12, 2026Updated last month
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frame…☆19Jun 4, 2023Updated 2 years ago