KellerJordan / cifar10-airbenchLinks

CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds

☆274

Alternatives and similar repositories for cifar10-airbench

Users that are interested in cifar10-airbench are comparing it to the libraries listed below

Sorting:

HomebrewML / HeavyBall
Efficient optimizers
☆252Updated last week
nanowell / AdEMAMix-Optimizer-Pytorch
The AdEMAMix Optimizer: Better, Faster, Older.
☆184Updated 10 months ago
modula-systems / modula
🧱 Modula software package
☆210Updated this week
nikhilvyas / SOAP
☆206Updated 8 months ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 7 months ago
facebookresearch / optimizers
For optimization algorithm research and development.
☆525Updated this week
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆184Updated 11 months ago
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆128Updated 3 weeks ago
HazyResearch / flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
☆325Updated 7 months ago
google-deepmind / nanodo
☆274Updated last year
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆280Updated last year
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆389Updated this week
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆288Updated last month
srush / annotated-mamba
Annotated version of the Mamba paper
☆487Updated last year
cloneofsimo / min-fsdp
☆82Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆149Updated last month
facebookresearch / capi
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆113Updated 3 months ago
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆185Updated 8 months ago
fferflo / einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
☆388Updated 3 months ago
apple / ml-sigma-reparam
☆304Updated last year
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆180Updated this week
cloneofsimo / scaling-guide
WIP
☆93Updated 11 months ago
kvfrans / splus
☆115Updated last month
pbelcak / fastfeedforward
A repository for log-time feedforward networks
☆222Updated last year
KindXiaoming / grow-crystals
Getting crystal-like representations with harmonic loss
☆193Updated 4 months ago
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆102Updated 2 months ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 2 weeks ago
google-research / jaxpruner
☆232Updated 5 months ago
apple / ml-sigmoid-attention
☆293Updated 3 months ago