qlabs-eng/slowrun

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qlabs-eng/slowrun)

qlabs-eng / slowrun

100M tokens. Infinite compute. Lowest val loss wins.

☆514

Alternatives and similar repositories for slowrun

Users that are interested in slowrun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,496Jul 3, 2026Updated 2 weeks ago
sanyalsunny111 / Looped-GPT
View on GitHub
Minimal and highly hackable implementation of Looped Transformers with GPT
☆25Mar 8, 2026Updated 4 months ago
openai / parameter-golf
View on GitHub
Train the smallest LM you can that fits in 16MB. Best model wins!
☆5,163May 4, 2026Updated 2 months ago
hiverge / cifar10-speedrun
View on GitHub
CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.
☆79Oct 17, 2025Updated 9 months ago
EleutherAI / nanoGPT-mup
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆199Jan 19, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
marin-community / marin
View on GitHub
Open-source framework for the research and development of foundation models.
☆1,207Updated this week
KellerJordan / Muon
View on GitHub
Muon is an optimizer for hidden layers in neural networks
☆2,705May 24, 2026Updated last month
Noumena-Network / nmoe
View on GitHub
MoE training for Me and You and maybe other people
☆394Mar 15, 2026Updated 4 months ago
anadim / subleq-transformer
View on GitHub
A transformer that executes a one-instruction Turing-complete computer — two approaches: hand-coded weights (no training) and learned fro…
☆41Mar 3, 2026Updated 4 months ago
microsoft / dion
View on GitHub
Dion optimizer algorithm
☆494Jul 12, 2026Updated last week
anpaure / cp_eval
View on GitHub
Tiny evaluation of leading LLMs on competitive programming problems
☆14Apr 10, 2026Updated 3 months ago
sjelassi / ebft_openrlhf
View on GitHub
Code for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".
☆23Mar 16, 2026Updated 4 months ago
xjdr-alt / simple_transformer
View on GitHub
Simple Transformer in Jax
☆143Jun 22, 2024Updated 2 years ago
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,387Updated this week
OpenEvaByte / evabyte
View on GitHub
EvaByte: Efficient Byte-level Language Models at Scale
☆119Apr 22, 2025Updated last year
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,696Updated this week
aHapBean / NITP
View on GitHub
[ICML 2026] NITP: Next Implicit Token Prediction for LLM Pre-training
☆33May 26, 2026Updated last month
j4orz / ateenysitp
View on GitHub
a whirlwind tour to deep learning and deep learning systems
☆81Updated this week
facebookresearch / llm-speedrunner
View on GitHub
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆145May 6, 2026Updated 2 months ago
chanind / claude-auto-research-synthsaebench
View on GitHub
☆23Mar 11, 2026Updated 4 months ago
strangeloopcanon / tevo
View on GitHub
TEVO: evolve LM motifs cheaply, then validate them in downstream train.py loops.
☆19Apr 18, 2026Updated 3 months ago
vukrosic / muon-optimizer-guide
View on GitHub
Use Muon optimizer instead of AdamW.
☆48Mar 2, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,367Updated this week
AlpinDale / qwen_megakernel
View on GitHub
Aggressive decode optimizations for Qwen3-0.6B on RTX 5090
☆53Feb 25, 2026Updated 4 months ago
facebookresearch / PhysicsLM4
View on GitHub
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆356May 20, 2026Updated 2 months ago
cat-state / modded-nanogpt-moe
View on GitHub
☆17Sep 6, 2025Updated 10 months ago
Yifei-Zuo / Parallax
View on GitHub
Official repository for Parallax (Parameterized Local Linear Attention)
☆65Jul 7, 2026Updated last week
ServiceNow / PipelineRL
View on GitHub
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆427Updated this week
nikhilvyas / SOAP
View on GitHub
☆273Dec 2, 2024Updated last year
Dao-AILab / gram-newton-schulz
View on GitHub
Fast Polar Decomposition for Muon
☆165Jul 2, 2026Updated 2 weeks ago
modula-systems / modula
View on GitHub
🧱 Modula software package
☆337Aug 18, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexzhang13 / longcot-mini-rlm-results
View on GitHub
Storing the LongCoT-mini results for RLM(GPT-5.2)
☆20Apr 26, 2026Updated 2 months ago
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
JoeLi12345 / nGPT
View on GitHub
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆112Mar 7, 2025Updated last year
MoonshotAI / Moonlight
View on GitHub
Muon is Scalable for LLM Training
☆1,504Aug 3, 2025Updated 11 months ago
valine / training-hot-swap
View on GitHub
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Apr 21, 2025Updated last year
google-deepmind / nanodo
View on GitHub
☆304Jul 15, 2024Updated 2 years ago
borawhocodess / modded-nanotabpfn
View on GitHub
speedrunning TFM pretraining
☆48Jun 28, 2026Updated 3 weeks ago