100M tokens. Infinite compute. Lowest val loss wins.
☆398Apr 7, 2026Updated this week
Alternatives and similar repositories for slowrun
Users that are interested in slowrun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 5 months ago
- llm201n: neural networks zero to super hero. the bridge from mirograd to tinygrad!☆63Updated this week
- ☆26Feb 20, 2026Updated last month
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A power user focused interface for LLM base models.☆67Mar 11, 2026Updated last month
- ☆31Nov 30, 2025Updated 4 months ago
- ☆10Oct 24, 2024Updated last year
- ☆56Mar 13, 2026Updated 3 weeks ago
- ☆36Feb 26, 2024Updated 2 years ago
- Benchmarking Optimizers for LLM Pretraining☆57Dec 30, 2025Updated 3 months ago
- Add ability to interrupt own message☆14Apr 21, 2024Updated last year
- Code for minimum-entropy coupling.☆33Jan 6, 2026Updated 3 months ago
- ☆262Dec 2, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated 2 months ago
- Approve Claude Code permission requests from your phone via ntfy☆54Mar 2, 2026Updated last month
- ☆52Mar 30, 2026Updated 2 weeks ago
- Automatically review Claude Code plans using external AI CLIs☆55Mar 2, 2026Updated last month
- Rose (n-way) trees with both upwards- (i.e. cached) and downwards-traveling (i.e. accumulating) monoidal annotations.☆16Mar 31, 2026Updated last week
- Timelight: Universal Path Generator☆23Aug 24, 2025Updated 7 months ago
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- Haskell port of the Tensor Algebra COmpiler☆16Nov 18, 2019Updated 6 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆139Apr 3, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆20Jan 27, 2024Updated 2 years ago
- ☆28Oct 7, 2025Updated 6 months ago
- ☆34Jul 5, 2023Updated 2 years ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- backlinks graph for Craft Docs spaces☆20Mar 1, 2026Updated last month
- Model REVOLVER, a human in the loop model mixing system.☆33Aug 2, 2023Updated 2 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Sep 3, 2024Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆151Oct 2, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- NanoGPT (124M) in 2 minutes☆5,070Mar 29, 2026Updated 2 weeks ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆71Updated this week
- Design and analyze optimal deep learning models.☆31Aug 2, 2025Updated 8 months ago
- Computational abilities and efficiency of neural networks☆57Jul 19, 2025Updated 8 months ago
- ☆59Jun 23, 2025Updated 9 months ago
- Official implementation of IJCAI 2024 paper "Cross-Domain Feature Augmentation for Domain Generalization"☆18Feb 21, 2026Updated last month