Building a GPT-like LLM from scratch with PyTorch.
☆347Dec 20, 2024Updated last year
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆90,284Apr 6, 2026Updated last week
- Code, figures, and supplementary materials for the paper "A Harmonic Field Model of Consciousness in the Human Brain". Includes Python s…☆21Dec 1, 2025Updated 4 months ago
- ☆18Feb 27, 2025Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆204May 12, 2024Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-Agent AI system that automatically retrieves the latest Football news and creates a newsletter☆28Apr 16, 2024Updated last year
- A much better Research Agent 🎓☆41Feb 18, 2026Updated last month
- A single-threaded event-driven cache☆19Oct 7, 2024Updated last year
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆222Mar 19, 2026Updated 3 weeks ago
- Chain MiniMax Speech + Nano Banana Pro + Wan 2.6 to generate videos from script segments. Built for the official Wan 2.6 release with fal…☆25Dec 19, 2025Updated 3 months ago
- ☆33Jul 27, 2025Updated 8 months ago
- WindowTitleEx shows the HWND, thread ID and process in Windows titles. Tray icon to remove these extra is included.☆20Mar 15, 2020Updated 6 years ago
- Typescript implementation of Relaxed Radix Balanced Trees☆19Sep 15, 2024Updated last year
- Context Engineering Course with DSPy☆219Jul 27, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- The A+ programming language from Morgan Stanley☆41Mar 21, 2014Updated 12 years ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆25,024Dec 17, 2025Updated 3 months ago
- ☆14Jan 14, 2025Updated last year
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆29Feb 4, 2025Updated last year
- Serverless ML Course for building AI-enabled Prediction Services from models and features☆15Oct 19, 2022Updated 3 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- Easy to use Verifiable AI and smart contracts interoperability.☆31Jun 28, 2024Updated last year
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,082Jan 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Microservice for user authentication, authorization based on JWT mechanism with role-based access control. Project implement Event Driven…☆28May 15, 2025Updated 10 months ago
- ☆10Apr 10, 2014Updated 12 years ago
- Free and open-source curriculum to master artificial intelligence☆35Feb 28, 2025Updated last year
- Save & share Bard conversations. Discover & use Bard prompts. Enhance Bard with more features.☆13May 26, 2023Updated 2 years ago
- ☆24Apr 24, 2025Updated 11 months ago
- An application built on the Model Context Protocol (MCP) that transforms any website into highly relevant content based on your queries. …☆63Apr 18, 2025Updated 11 months ago
- chat-with-docs☆19Nov 28, 2024Updated last year
- ☆42Apr 30, 2024Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆413Nov 11, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-agent system for booking appointments and generating PDF invoices☆13Jul 16, 2025Updated 8 months ago
- ☆21Jul 15, 2025Updated 8 months ago
- Knowledge Graph Generator app☆34Apr 18, 2024Updated last year
- Generate high-quality articles for your blog using a SERP workflow and AI☆295Dec 22, 2024Updated last year
- An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.☆56Jan 23, 2026Updated 2 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆294May 16, 2025Updated 10 months ago
- A WordPress plugin starter template for coding with AI IDEs, like; Augment Code, Cursor, Windsurf, Loveable, Bolt, Cline, Roo Code, etc☆15Mar 26, 2026Updated 2 weeks ago