openai/parameter-golf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openai/parameter-golf)

openai / parameter-golf

Train the smallest LM you can that fits in 16MB. Best model wins!

☆5,176

Alternatives and similar repositories for parameter-golf

Users that are interested in parameter-golf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,600Updated this week
qlabs-eng / slowrun
View on GitHub
100M tokens. Infinite compute. Lowest val loss wins.
☆518Jul 3, 2026Updated 3 weeks ago
karpathy / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆56,732Jul 4, 2026Updated 3 weeks ago
karpathy / autoresearch
View on GitHub
AI agents running research on single-GPU nanochat training automatically
☆92,224Mar 26, 2026Updated 4 months ago
anthropics / original_performance_takehome
View on GitHub
Anthropic's original performance take-home, now open for you to try!
☆4,076Jan 22, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
huggingface / ml-intern
View on GitHub
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
☆10,698Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,679Updated this week
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,410Updated this week
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,463Updated this week
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,759Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,965Updated this week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,317Updated this week
thinking-machines-lab / tinker-cookbook
View on GitHub
Post-training with Tinker
☆3,939Updated this week
openai / codex
View on GitHub
Lightweight coding agent that runs in your terminal
☆102,101Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆61,632Nov 12, 2025Updated 8 months ago
openai / gpt-oss
View on GitHub
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
☆20,266Updated this week
openai / harmony
View on GitHub
Renderer for the harmony response format to be used with gpt-oss
☆4,468Apr 8, 2026Updated 3 months ago
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,854Updated this week
openai / symphony
View on GitHub
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding ag…
☆26,280Updated this week
KellerJordan / Muon
View on GitHub
Muon is an optimizer for hidden layers in neural networks
☆2,747May 24, 2026Updated 2 months ago
PufferAI / PufferLib
View on GitHub
Puffing up reinforcement learning
☆6,198Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,699Updated this week
marin-community / marin
View on GitHub
Open-source framework for the research and development of foundation models.
☆1,226Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HazyResearch / ThunderKittens
View on GitHub
Tile primitives for speedy kernels
☆3,566Jul 13, 2026Updated 2 weeks ago
MoonshotAI / Attention-Residuals
View on GitHub
☆3,417Mar 17, 2026Updated 4 months ago
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,679Apr 26, 2026Updated 3 months ago
deepseek-ai / DeepSpec
View on GitHub
DeepSpec: a full-stack codebase for training and evaluating speculative decoding algorithms
☆6,803Jul 9, 2026Updated 2 weeks ago
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,102Updated this week
aisa-group / PostTrainBench
View on GitHub
Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours
☆472Jul 22, 2026Updated last week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,434Updated this week
sapientinc / HRM
View on GitHub
Hierarchical Reasoning Model Official Release
☆12,599Mar 31, 2026Updated 3 months ago
RightNow-AI / autokernel
View on GitHub
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
☆1,483Mar 19, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
deepseek-ai / Engram
View on GitHub
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆4,568Jan 14, 2026Updated 6 months ago
Dao-AILab / quack
View on GitHub
A Quirky Assortment of CuTe Kernels
☆1,076Updated this week
tanishqkumar / ssd
View on GitHub
A lightweight inference engine supporting speculative speculative decoding (SSD).
☆975May 10, 2026Updated 2 months ago
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,768May 26, 2026Updated 2 months ago
deepseek-ai / TileKernels
View on GitHub
A kernel library written in tilelang
☆1,677Apr 23, 2026Updated 3 months ago
huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,260Aug 26, 2025Updated 11 months ago
alexzhang13 / rlm
View on GitHub
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆5,326Jun 26, 2026Updated last month