google-research / kauldronLinks
Modular, scalable library to train ML models
☆155Updated last week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- Dion optimizer algorithm☆318Updated last week
- A JAX-native LLM Post-Training Library☆123Updated this week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated last year
- ☆251Updated this week
- Scalable and Performant Data Loading☆291Updated last week
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated last month
- ☆275Updated last year
- Library for reading and processing ML training data.☆519Updated this week
- ☆307Updated last year
- ☆188Updated last month
- For optimization algorithm research and development.☆530Updated this week
- Minimal yet performant LLM examples in pure JAX☆150Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆153Updated 2 months ago
- A simple library for scaling up JAX programs☆143Updated 9 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆304Updated this week
- ☆82Updated last year
- PyTorch implementation of models from the Zamba2 series.☆184Updated 7 months ago
- A set of Python scripts that makes your experience on TPU better☆54Updated last year
- JAX-Toolbox☆331Updated this week
- ☆139Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆289Updated last week
- 🧱 Modula software package☆225Updated last week
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆131Updated last year
- ☆115Updated last week
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 8 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆101Updated 7 months ago
- PyTorch Single Controller☆368Updated this week
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆35Updated 2 weeks ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 2 months ago