Building a GPT-like LLM from scratch with PyTorch.
☆353Dec 20, 2024Updated last year
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Feb 27, 2025Updated last year
- An introduction to DSPy☆34Aug 30, 2025Updated 9 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆92Oct 23, 2025Updated 7 months ago
- This project will help you get started with using the Neurosity Crown sdk with Open AI☆12Aug 5, 2024Updated last year
- A much better Research Agent 🎓☆41Feb 18, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆224Mar 19, 2026Updated 2 months ago
- ☆33Jul 27, 2025Updated 10 months ago
- Building LLMs from scratch following the book from S. Raschka☆34Mar 27, 2025Updated last year
- Context Engineering Course with DSPy☆223Jul 27, 2025Updated 10 months ago
- A Django-based web application that simplifies exam lifecycle management from creation to grading, integrating OCR and AI for an automate…☆12Jul 7, 2024Updated last year
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆30Feb 4, 2025Updated last year
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆26,885Apr 24, 2026Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Apr 17, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 5 months ago
- Use AI to instantly summarize websites' terms of service and highlight any concerning elements☆17Apr 5, 2025Updated last year
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆17Mar 11, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Easy to use Verifiable AI and smart contracts interoperability.☆31Jun 28, 2024Updated last year
- manage your chrome tab overload in markdown☆76Dec 29, 2025Updated 5 months ago
- Generative AI with Python and PyTorch , Second Edition - Published by Packt☆219Jan 18, 2026Updated 4 months ago
- Free and open-source curriculum to master artificial intelligence☆36Feb 28, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Apr 24, 2025Updated last year
- chat-with-docs☆20Nov 28, 2024Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆419Nov 11, 2025Updated 7 months ago
- Knowledge Graph Generator app☆35Apr 18, 2024Updated 2 years ago
- Generate high-quality articles for your blog using a SERP workflow and AI☆297Dec 22, 2024Updated last year
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- A WordPress plugin starter template for coding with AI IDEs, like; Augment Code, Cursor, Windsurf, Loveable, Bolt, Cline, Roo Code, etc☆15Updated this week
- Materials for the Ultimate Hybrid Search Workshop☆46Updated this week
- Directory of the most valuable AI content creators on YouTube, categorized by specialty.☆22Feb 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- A couple scripts to grab stats from email☆43Sep 10, 2024Updated last year
- A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.☆4,956Aug 18, 2025Updated 9 months ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- 🧡 Hacker News summaries☆22Apr 10, 2024Updated 2 years ago
- ☆13Mar 16, 2025Updated last year
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆291Apr 30, 2026Updated last month