google-research / kauldronLinks

Modular, scalable library to train ML models

☆128

Alternatives and similar repositories for kauldron

Users that are interested in kauldron are comparing it to the libraries listed below

Sorting:

google-deepmind / nanodo
☆270Updated 11 months ago
young-geng / scalax
A simple library for scaling up JAX programs
☆139Updated 7 months ago
jax-ml / jax-llm-examples
☆126Updated last month
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆139Updated this week
evanatyourservice / kron_torch
An implementation of PSGD Kron second-order optimizer for PyTorch
☆91Updated 2 months ago
google / grain
Library for reading and processing ML training data.
☆463Updated this week
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆103Updated 2 months ago
apple / ml-planner
☆53Updated last year
kvfrans / splus
☆104Updated 2 weeks ago
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆278Updated last year
google-deepmind / mishax
☆134Updated 2 months ago
ml-gde / jflux
JAX Implementation of Black Forest Labs' Flux.1 family of models
☆34Updated 8 months ago
iliao2345 / CompressARC
☆157Updated 2 months ago
modula-systems / modula
🧱 Modula software package
☆200Updated 3 months ago
fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆75Updated 6 months ago
stanford-crfm / haliax
Named Tensors for Legible Deep Learning in JAX
☆181Updated last week
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆140Updated last month
google-deepmind / tf2jax
☆114Updated this week
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆65Updated 10 months ago
cloneofsimo / scaling-guide
WIP
☆93Updated 10 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆129Updated last year
ShadeAlsha / ICon
ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"
☆96Updated this week
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆98Updated 5 months ago
samuela / torch2jax
Run PyTorch in JAX. 🤝
☆253Updated 4 months ago
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆55Updated 11 months ago
epfml / DenseFormer
☆81Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
cloneofsimo / min-fsdp
☆78Updated 11 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆162Updated 2 weeks ago
jax-ml / jax-ai-stack
☆193Updated this week