jaymody/picoGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jaymody/picoGPT)

jaymody / picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

☆3,468

Alternatives and similar repositories for picoGPT

Users that are interested in picoGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karpathy / minGPT
View on GitHub
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆24,731Aug 15, 2024Updated last year
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆61,458Nov 12, 2025Updated 8 months ago
karpathy / llama2.c
View on GitHub
Inference Llama 2 in one file of pure C
☆19,751Aug 6, 2024Updated last year
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,363Oct 28, 2024Updated last year
tinygrad / tinygrad
View on GitHub
You like pytorch? You like micrograd? You love tinygrad! ❤️
☆33,324Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,912Jul 29, 2024Updated last year
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,379Aug 17, 2024Updated last year
hpcaitech / ColossalAI
View on GitHub
Making large AI models cheaper, faster and more accessible
☆41,420Jul 13, 2026Updated last week
ggml-org / ggml
View on GitHub
Tensor library for machine learning
☆15,048Updated this week
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,082Jul 1, 2025Updated last year
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,250Jul 17, 2024Updated 2 years ago
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,636Updated this week
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,500May 1, 2026Updated 2 months ago
meta-pytorch / gpt-fast
View on GitHub
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,229Aug 22, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
karpathy / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆30,624Jun 26, 2025Updated last year
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,493Updated this week
jzhang38 / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆9,017May 3, 2024Updated 2 years ago
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆121,372Updated this week
karpathy / minbpe
View on GitHub
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆10,634Jul 1, 2024Updated 2 years ago
openai / gpt-2
View on GitHub
Code for the paper "Language Models are Unsupervised Multitask Learners"
☆25,015Aug 14, 2024Updated last year
deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆42,787Updated this week
jax-ml / jax
View on GitHub
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆36,040Updated this week
databrickslabs / dolly
View on GitHub
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
☆10,807Jun 30, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,038Updated this week
meta-llama / llama
View on GitHub
Inference code for Llama models
☆59,521Jan 26, 2025Updated last year
amazon-science / mm-cot
View on GitHub
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
☆3,984Jun 12, 2024Updated 2 years ago
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆22,988Updated this week
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,394May 27, 2025Updated last year
karpathy / micrograd
View on GitHub
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
☆16,822Aug 8, 2024Updated last year
nebuly-ai / optimate
View on GitHub
A collection of libraries to optimise AI model performances
☆8,332Jul 22, 2024Updated 2 years ago
lucidrains / PaLM-rlhf-pytorch
View on GitHub
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
☆7,867May 29, 2026Updated last month
bigscience-workshop / petals
View on GitHub
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆10,399Sep 7, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mistralai / mistral-inference
View on GitHub
Official inference library for Mistral models
☆10,830Jun 16, 2026Updated last month
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,531Jul 16, 2023Updated 3 years ago
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,691May 21, 2026Updated 2 months ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,913Updated this week
Stability-AI / StableLM
View on GitHub
StableLM: Stability AI Language Models
☆15,686Apr 8, 2024Updated 2 years ago
mlc-ai / web-llm
View on GitHub
High-performance In-browser LLM Inference Engine
☆18,441Jun 9, 2026Updated last month
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,480Jun 7, 2025Updated last year