CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆362Nov 15, 2025Updated 3 months ago
Alternatives and similar repositories for cifar10-airbench
Users that are interested in cifar10-airbench are comparing it to the libraries listed below
Sorting:
- NanoGPT (124M) in 2 minutes☆4,734Feb 27, 2026Updated last week
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Muon is an optimizer for hidden layers in neural networks☆2,350Jan 19, 2026Updated last month
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆190Jan 11, 2026Updated last month
- ☆34Jan 25, 2024Updated 2 years ago
- 🧱 Modula software package☆324Aug 18, 2025Updated 6 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆98Jul 24, 2025Updated 7 months ago
- Efficient optimizers☆285Dec 20, 2025Updated 2 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,300Dec 18, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,262May 21, 2025Updated 9 months ago
- ☆27May 3, 2024Updated last year
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Feb 16, 2024Updated 2 years ago
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated 11 months ago
- ☆52Jun 10, 2024Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Jul 29, 2024Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- WIP