google-research / kauldronLinks
Modular, scalable library to train ML models
☆182Updated 2 weeks ago
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- a Jax quantization library☆80Updated last week
- ☆294Updated this week
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 5 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- Dion optimizer algorithm☆413Updated this week
- ☆158Updated 2 months ago
- ☆287Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated last month
- Scalable and Performant Data Loading☆360Updated last week
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆155Updated last month
- For optimization algorithm research and development.☆556Updated 2 weeks ago
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆207Updated 2 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- Minimal yet performant LLM examples in pure JAX☆225Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆328Updated this week
- ☆107Updated 5 months ago
- 🧱 Modula software package☆322Updated 4 months ago
- Google TPU optimizations for transformers models☆133Updated 3 weeks ago
- ☆191Updated 3 weeks ago
- MoE training for Me and You and maybe other people☆315Updated this week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- JAX-Toolbox☆373Updated this week
- 👷 Build compute kernels☆198Updated 2 weeks ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated last year
- A set of Python scripts that makes your experience on TPU better☆55Updated 3 months ago
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆159Updated 2 weeks ago
- DeMo: Decoupled Momentum Optimization☆198Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- 📄Small Batch Size Training for Language Models☆77Updated 3 months ago