A straightforward method for training your LLM, from downloading data to generating text.
☆524Aug 3, 2025Updated 7 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆77Aug 18, 2025Updated 6 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆202May 12, 2024Updated last year
- Building DeepSeek R1 from Scratch☆748Mar 21, 2025Updated 11 months ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆21Dec 4, 2025Updated 3 months ago
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- Train a 29M parameter GPT from Scratch☆33Mar 4, 2025Updated last year
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆31Jan 22, 2025Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆204Aug 23, 2024Updated last year
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated 11 months ago
- a feedforward neural network from scratch☆10Aug 5, 2024Updated last year
- Implementation of 12 AI agents evaluation techniques☆36Jul 31, 2025Updated 7 months ago
- Building LLaMA 4 MoE from Scratch☆72Apr 15, 2025Updated 10 months ago
- ☆15Nov 4, 2024Updated last year
- DSA and Visualizations for various sorting algorithms☆15Apr 9, 2025Updated 11 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆111Mar 31, 2025Updated 11 months ago
- Building a Multi-Agent AI System with LangGraph and LangSmith☆284May 31, 2025Updated 9 months ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆29Jan 13, 2026Updated last month
- Easily turn your Python functions into GUI applications☆98Dec 27, 2025Updated 2 months ago
- This project is a template for a Python package.☆34Nov 24, 2025Updated 3 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆29May 11, 2025Updated 9 months ago
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 8 months ago
- ☆11Jan 12, 2017Updated 9 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆78Apr 4, 2025Updated 11 months ago
- ☆29Nov 9, 2025Updated 4 months ago
- Implementation of all RL algorithms in a simpler way☆1,407Aug 29, 2025Updated 6 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆38Oct 9, 2025Updated 5 months ago
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated last month
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- Write Web API clients using annotations in python☆16Updated this week
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- ☆11Aug 3, 2024Updated last year
- Welcome to the LLM Tutorials and RAG Implementations repository! This repository provides tutorials, guides, and implementations for work…☆12Jul 1, 2025Updated 8 months ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- ☆527Feb 4, 2026Updated last month
- Toolkit for graph-relational data across space and time☆118Jan 28, 2026Updated last month
- Cross platform implementation of Minesweeper-like game written from scratch in C++ with SDL2, Dear ImGui and mINI.☆13Jan 28, 2026Updated last month