KellerJordan / hlb-CIFAR10Links

Train to 94% on CIFAR-10 in 4.4 seconds on a single A100

☆12

Alternatives and similar repositories for hlb-CIFAR10

Users that are interested in hlb-CIFAR10 are comparing it to the libraries listed below

Sorting:

ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆83Updated 11 months ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
dshah3 / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆64Updated last year
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 3 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆276Updated 3 weeks ago
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated last month
cloneofsimo / min-fsdp
☆91Updated last year
google-deepmind / nanodo
☆283Updated last year
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
dvruette / barrel-rec-pytorch
☆53Updated last year
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Updated 6 months ago
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆321Updated 3 months ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆215Updated last year
modula-systems / modula
🧱 Modula software package
☆300Updated 2 months ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆132Updated 10 months ago
cat-state / tinypar
☆20Updated 2 years ago
HazyResearch / train-tk
train with kittens!
☆63Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆351Updated last year
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆142Updated last year
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆193Updated last month
sholtodouglas / scalingExperiments
☆62Updated 3 years ago
cloneofsimo / scaling-guide
WIP
☆93Updated last year
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆188Updated 3 weeks ago
nikhilvyas / SOAP
☆221Updated 11 months ago
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆132Updated 3 months ago
euclaise / supertrainer2000
☆50Updated last year
warner-benjamin / optimi
Fast, Modern, and Low Precision PyTorch Optimizers
☆116Updated 2 months ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year