Building a GPT-like LLM from scratch with PyTorch.
☆339Dec 20, 2024Updated last year
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for Cuda Learning☆31Jul 13, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆88,603Mar 7, 2026Updated 2 weeks ago
- ☆18Feb 27, 2025Updated last year
- ☆63Nov 23, 2025Updated 4 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆88Oct 23, 2025Updated 5 months ago
- This project will help you get started with using the Neurosity Crown sdk with Open AI☆12Aug 5, 2024Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆201May 12, 2024Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆3,426Sep 7, 2025Updated 6 months ago
- ☆22Aug 27, 2025Updated 6 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆221Jan 3, 2026Updated 2 months ago
- Tool that uses tavily and langraph to conduct research, then gsuite apis to organize it☆20Nov 17, 2024Updated last year
- Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.☆73Jan 6, 2026Updated 2 months ago
- ☆33Jul 27, 2025Updated 7 months ago
- Typescript implementation of Relaxed Radix Balanced Trees☆19Sep 15, 2024Updated last year
- An app that extracts your twitter threads into a downloadable CSV file.☆13Apr 8, 2023Updated 2 years ago
- Context Engineering Course with DSPy☆216Jul 27, 2025Updated 7 months ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆16Mar 9, 2026Updated 2 weeks ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆24,315Dec 17, 2025Updated 3 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆29Feb 4, 2025Updated last year
- Serverless ML Course for building AI-enabled Prediction Services from models and features☆15Oct 19, 2022Updated 3 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- Use AI to instantly summarize websites' terms of service and highlight any concerning elements☆17Apr 5, 2025Updated 11 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,076Jan 13, 2025Updated last year
- The fastest HTML builder engine for Crystal☆17Mar 4, 2026Updated 2 weeks ago
- Free and open-source curriculum to master artificial intelligence☆35Feb 28, 2025Updated last year
- chat-with-docs☆19Nov 28, 2024Updated last year
- ☆93Nov 11, 2025Updated 4 months ago
- Four-tier memory architecture for OpenCode AI agents: persistent core memory, session working memory, smart pruning, and pressure monitor…☆46Feb 19, 2026Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆410Nov 11, 2025Updated 4 months ago
- code for Towards Data Science article on prompt-loss-weight☆11Jun 4, 2025Updated 9 months ago
- Generate high-quality articles for your blog using a SERP workflow and AI☆296Dec 22, 2024Updated last year
- Minute-long video generation at 24FPS.☆59Feb 2, 2026Updated last month
- An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.☆52Jan 23, 2026Updated 2 months ago
- Machine bootstrapping tool with a focus on sensible defaults, conventions, and avoidance of vendoring☆24Feb 1, 2026Updated last month
- 畳み込みニューラルネットワークをリアルタイムにビジュアル化するサイト☆29Dec 7, 2023Updated 2 years ago