google-deepmind / nanodoLinks

☆285

Alternatives and similar repositories for nanodo

Users that are interested in nanodo are comparing it to the libraries listed below

Sorting:

modula-systems / modula
🧱 Modula software package
☆307Updated 3 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆685Updated last week
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆204Updated 2 months ago
young-geng / scalax
A simple library for scaling up JAX programs
☆144Updated last month
HomebrewML / HeavyBall
Efficient optimizers
☆276Updated 3 weeks ago
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆182Updated last year
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆175Updated last year
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated this week
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆143Updated last year
marin-community / haliax
Named Tensors for Legible Deep Learning in JAX
☆212Updated 3 weeks ago
google-research / jaxpruner
☆234Updated 9 months ago
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆297Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆174Updated 5 months ago
google / grain
Library for reading and processing ML training data.
☆611Updated this week
evanatyourservice / psgd_jax
Implementation of PSGD optimizer in JAX
☆35Updated 11 months ago
erfanzar / EasyDeL
Accelerate, Optimize performance with streamlined training and serving options with JAX.
☆325Updated this week
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆327Updated 2 weeks ago
microsoft / dion
Dion optimizer algorithm
☆395Updated 2 weeks ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆135Updated 11 months ago
jenkspt / gpt-jax
Jax/Flax rewrite of Karpathy's nanoGPT
☆62Updated 2 years ago
AllanYangZhou / midGPT
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆24Updated last year
nikhilvyas / SOAP
☆224Updated last year
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆401Updated this week
srush / Autodiff-Puzzles
☆460Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 11 months ago
cloneofsimo / min-fsdp
☆91Updated last year
rwitten / HighPerfLLMs2024
☆546Updated last year