google-research / kauldronLinks
Modular, scalable library to train ML models
☆168Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 3 months ago
- a Jax quantization library☆53Updated this week
- ☆283Updated last year
- Minimal yet performant LLM examples in pure JAX☆187Updated last month
- Dion optimizer algorithm☆374Updated last month
- ☆120Updated 4 months ago
- A simple library for scaling up JAX programs☆144Updated last year
- ☆179Updated 2 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆295Updated last year
- 🧱 Modula software package☆299Updated 2 months ago
- ☆269Updated last week
- ☆211Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆147Updated 6 months ago
- Cost aware hyperparameter tuning algorithm☆172Updated last year
- ☆197Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆298Updated this week
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 10 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 9 months ago
- A simple, performant and scalable JAX-based world modeling codebase☆77Updated this week
- Implementation of Diffusion Transformer (DiT) in JAX☆294Updated last year
- For optimization algorithm research and development.☆542Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆317Updated this week
- A set of Python scripts that makes your experience on TPU better☆54Updated last month
- ☆103Updated 3 months ago
- JAX-Toolbox☆356Updated this week
- ☆81Updated last year
- Scalable and Performant Data Loading☆330Updated this week
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆116Updated last week