tysam-code/hlb-CIFAR10

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tysam-code/hlb-CIFAR10)

tysam-code / hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

☆1,305

Alternatives and similar repositories for hlb-CIFAR10

Users that are interested in hlb-CIFAR10 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tysam-code / hlb-gpt
View on GitHub
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆357Jul 29, 2024Updated last year
JonasGeiping / cramming
View on GitHub
Cramming the training of a (BERT-type) language model into limited compute.
☆1,366Jun 13, 2024Updated 2 years ago
libffcv / ffcv
View on GitHub
FFCV: Fast Forward Computer Vision (and other ML workloads!)
☆2,989Jun 16, 2024Updated 2 years ago
KellerJordan / cifar10-airbench
View on GitHub
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆384Nov 15, 2025Updated 8 months ago
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,082Jul 1, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
facebookresearch / schedule_free
View on GitHub
Schedule-Free Optimization in PyTorch
☆2,314Jun 18, 2026Updated last month
mosaicml / composer
View on GitHub
Supercharge Your Model Training
☆5,487Apr 29, 2026Updated 2 months ago
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,553Jul 5, 2026Updated 2 weeks ago
google-research / tuning_playbook
View on GitHub
A playbook for systematically maximizing the performance of deep learning models.
☆30,249Jun 18, 2024Updated 2 years ago
meta-pytorch / gpt-fast
View on GitHub
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,229Aug 22, 2025Updated 10 months ago
facebookresearch / xformers
View on GitHub
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆10,524Updated this week
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,629Updated this week
borisdayma / vit-vqgan
View on GitHub
JAX implementation ViT-VQGAN
☆66Jul 23, 2022Updated 3 years ago
tinygrad / tinygrad
View on GitHub
You like pytorch? You like micrograd? You love tinygrad! ❤️
☆33,306Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PiotrNawrot / nanoT5
View on GitHub
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,021Aug 21, 2024Updated last year
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,333Updated this week
karpathy / minGPT
View on GitHub
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆24,721Aug 15, 2024Updated last year
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆61,351Nov 12, 2025Updated 8 months ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,993Updated this week
lucidrains / x-transformers
View on GitHub
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,922Updated this week
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,359Oct 28, 2024Updated last year
karpathy / llama2.c
View on GitHub
Inference Llama 2 in one file of pure C
☆19,745Aug 6, 2024Updated last year
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,006Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
microsoft / VQ-Diffusion
View on GitHub
Official implementation of VQ-Diffusion
☆981Apr 17, 2024Updated 2 years ago
google-deepmind / mctx
View on GitHub
Monte Carlo tree search in JAX
☆2,643Jul 9, 2026Updated last week
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,497Updated this week
srush / Tensor-Puzzles
View on GitHub
Solve puzzles. Improve your pytorch.
☆4,237Jul 15, 2024Updated 2 years ago
lucidrains / lion-pytorch
View on GitHub
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
☆2,195Jul 9, 2026Updated last week
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,738Updated this week
google-research / big_vision
View on GitHub
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,494May 19, 2025Updated last year
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,518Jul 3, 2026Updated 2 weeks ago
google-deepmind / penzai
View on GitHub
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,891Jun 22, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
google / flax
View on GitHub
Flax is a neural network library for JAX that is designed for flexibility.
☆7,272Jul 7, 2026Updated 2 weeks ago
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,380Aug 17, 2024Updated last year
srush / GPU-Puzzles
View on GitHub
Solve puzzles. Learn CUDA.
☆12,332Sep 1, 2024Updated last year
srush / LLM-Training-Puzzles
View on GitHub
What would you do with 1000 H100s...
☆1,181Jan 10, 2024Updated 2 years ago
lucidrains / PaLM-rlhf-pytorch
View on GitHub
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
☆7,866May 29, 2026Updated last month
apple / ml-sigma-reparam
View on GitHub
☆315Jun 21, 2024Updated 2 years ago
facebookincubator / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆4,725Jul 14, 2026Updated last week