kevinpdev / gpt-from-scratchLinks

Educational implementation of a small GPT model from scratch in a single Jupyter Notebook

☆104

Alternatives and similar repositories for gpt-from-scratch

Users that are interested in gpt-from-scratch are comparing it to the libraries listed below

Sorting:

rodmarkun / SmolML
A fully functional and simple Machine Learning library made entirely from scratch with Python.
☆295Updated this week
ash80 / RLHF_in_notebooks
RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
☆183Updated last month
FareedKhan-dev / train-llm-from-scratch
A straightforward method for training your LLM, from downloading data to generating text.
☆414Updated this week
OmuNaman / Machine-Learning-By-Hand
☆74Updated 2 months ago
AniruddhaChattopadhyay / Books
☆163Updated last month
MaxHastings / Kolo
The Fastest Way to Fine-Tune LLMs Locally
☆313Updated 4 months ago
ideaweaver-ai / Tiny-Children-Stories-30M-model
☆113Updated last month
ideaweaver-ai / DeepSeek-Children-Stories-15M-model
☆93Updated last month
liyuan24 / nanoDeepResearch
A Deep Research agent from scratch
☆201Updated 2 months ago
therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆604Updated 5 months ago
awjuliani / web-rl-playground
An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆71Updated 2 months ago
souzatharsis / tamingLLMs
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
☆321Updated 6 months ago
Fus3n / gem-assist
Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools
☆110Updated last month
muchlakshay / MLP-From-Scratch
A C++ implementation of a Multilayer Perceptron (MLP) neural network using Eigen, supporting multiple activation and loss functions, mini…
☆137Updated 2 months ago
kabir2505 / tiny-mixtral
☆43Updated 3 months ago
pixeltable / pixelagent
Pixelagent — Multimodal stateful agents
☆214Updated 2 months ago
codelion / adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
☆353Updated 2 weeks ago
JohnMachado11 / Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch.
☆274Updated 7 months ago
iluxu / llmbasedos
Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.
☆260Updated last month
bclarkson-code / Tricycle
Autograd to GPT-2 completely from scratch
☆115Updated 3 months ago
goyalpramod / Foundational-ML-papers
Implementations of Papers that I read, you can read my breakdown in my blog
☆78Updated 2 weeks ago
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆558Updated this week
patrickloeber / llm-data-scrapers
A list of useful Open Source tools and scrapers to gather data for LLMs
☆237Updated 5 months ago
willkurt / token-explorer
A simple tool that let's you explore different possible paths that an LLM might sample.
☆180Updated 3 months ago
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆266Updated 4 months ago
SajiJohnMiranda / DoCoreAI
DoCoreAI is a next-gen open-source AI profiler that optimizes reasoning, creativity, precision and temperature in a single step—cutting t…
☆44Updated 2 months ago
ivanfioravanti / prompt-eng-ollama-interactive-tutorial
Ollama's Interactive Prompt Engineering Tutorial
☆250Updated 8 months ago
gabrielchasukjin / cloi
Local debugging agent that runs in your terminal
☆392Updated 2 months ago
shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆148Updated last year
FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆69Updated 4 months ago