google-research / kauldronLinks
Modular, scalable library to train ML models
☆178Updated last week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- a Jax quantization library☆74Updated this week
- ☆288Updated this week
- ☆285Updated last year
- A simple library for scaling up JAX programs☆144Updated last month
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated 3 weeks ago
- 🧱 Modula software package☆315Updated 3 months ago
- Minimal yet performant LLM examples in pure JAX☆207Updated last week
- ☆118Updated this week
- ☆121Updated 6 months ago
- Scalable and Performant Data Loading☆352Updated this week
- ☆144Updated 3 months ago
- JAX-Toolbox☆367Updated this week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- ☆82Updated last year
- Dion optimizer algorithm☆403Updated this week
- Named Tensors for Legible Deep Learning in JAX☆212Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 11 months ago
- Cost aware hyperparameter tuning algorithm☆176Updated last year
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆325Updated this week
- ☆213Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆181Updated 3 weeks ago
- RLP: Reinforcement as a Pretraining Objective☆210Updated 2 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 6 months ago
- Getting crystal-like representations with harmonic loss☆192Updated 8 months ago
- For optimization algorithm research and development.☆548Updated 3 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAX☆298Updated last year
- Library for reading and processing ML training data.☆621Updated this week