Building a GPT-like LLM from scratch with PyTorch.
☆355Dec 20, 2024Updated last year
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for Cuda Learning☆36Jul 13, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆98,270Jun 2, 2026Updated last month
- Code, figures, and supplementary materials for the paper "A Harmonic Field Model of Consciousness in the Human Brain". Includes Python s…☆21Dec 1, 2025Updated 7 months ago
- ☆18Feb 27, 2025Updated last year
- An introduction to DSPy☆33Aug 30, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆63Nov 23, 2025Updated 7 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆91Oct 23, 2025Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 8 months ago
- A much better Research Agent 🎓☆41Feb 18, 2026Updated 4 months ago
- ☆22Aug 27, 2025Updated 10 months ago
- Resources to development of data analytics path☆27Apr 20, 2026Updated 2 months ago
- Tool that uses tavily and langraph to conduct research, then gsuite apis to organize it☆20Nov 17, 2024Updated last year
- WindowTitleEx shows the HWND, thread ID and process in Windows titles. Tray icon to remove these extra is included.☆20Mar 15, 2020Updated 6 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20May 11, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.☆74Jan 6, 2026Updated 5 months ago
- Building LLMs from scratch following the book from S. Raschka☆34Jun 25, 2026Updated last week
- An app that extracts your twitter threads into a downloadable CSV file.☆13Apr 8, 2023Updated 3 years ago
- Context Engineering Course with DSPy☆226Jul 27, 2025Updated 11 months ago
- A Django-based web application that simplifies exam lifecycle management from creation to grading, integrating OCR and AI for an automate…☆12Jul 7, 2024Updated last year
- A lightweight, cryptographically-authenticated UDP daemon for remote access, logging, and job control.☆24Apr 4, 2026Updated 3 months ago
- Serverless ML Course for building AI-enabled Prediction Services from models and features☆15Oct 19, 2022Updated 3 years ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆27,269Apr 24, 2026Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Mar 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Jun 19, 2026Updated 2 weeks ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,099Jan 13, 2025Updated last year
- Microservice for user authentication, authorization based on JWT mechanism with role-based access control. Project implement Event Driven…☆29May 15, 2025Updated last year
- Generative AI with Python and PyTorch , Second Edition - Published by Packt☆221Jan 18, 2026Updated 5 months ago
- ☆24Apr 24, 2025Updated last year
- An application built on the Model Context Protocol (MCP) that transforms any website into highly relevant content based on your queries. …☆63Apr 18, 2025Updated last year
- chat-with-docs☆20Nov 28, 2024Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆425Nov 11, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Knowledge Graph Generator app☆35Apr 18, 2024Updated 2 years ago
- GreenMe- GenAI app - Reduce Carbon Footprint for Greener Future.☆16Jan 3, 2025Updated last year
- Generate high-quality articles for your blog using a SERP workflow and AI☆295Dec 22, 2024Updated last year
- An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.☆56Jan 23, 2026Updated 5 months ago
- I am learning from Hugging Face.☆22Feb 11, 2024Updated 2 years ago
- 畳み込みニューラルネットワークをリアルタイムにビジュアル化するサイト☆29Dec 7, 2023Updated 2 years ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆296May 16, 2025Updated last year