The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling.
☆140Apr 30, 2026Updated this week
Alternatives and similar repositories for llm-speedrunner
Users that are interested in llm-speedrunner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 6 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 8 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆136Oct 16, 2025Updated 6 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 7 years ago
- Agentic RL Training at Scale☆1,338Updated this week
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆25Jun 28, 2025Updated 10 months ago
- ☆34May 14, 2025Updated 11 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆34Apr 6, 2026Updated 3 weeks ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆84Apr 24, 2026Updated last week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Muon fsdp 2☆56Aug 8, 2025Updated 8 months ago
- ☆18Nov 11, 2025Updated 5 months ago
- ☆28Jan 17, 2025Updated last year
- Pytorch routines for (Ker)nel (Mac)hines