google-research / kauldronLinks
Modular, scalable library to train ML models
☆166Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆293Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆95Updated 2 months ago
- RLP: Reinforcement as a Pretraining Objective☆155Updated last week
- Dion optimizer algorithm☆361Updated last week
- Scalable and Performant Data Loading☆304Updated this week
- 🧱 Modula software package☆282Updated last month
- ☆282Updated last year
- ☆142Updated last month
- ☆211Updated last week
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆112Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- ☆188Updated last month
- a Jax quantization library☆49Updated this week
- Minimal yet performant LLM examples in pure JAX☆181Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆144Updated 6 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆268Updated this week
- For optimization algorithm research and development.☆539Updated this week
- ☆81Updated last year
- ☆264Updated this week
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆38Updated last month
- ☆189Updated 2 weeks ago
- DeMo: Decoupled Momentum Optimization☆192Updated 10 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆103Updated 9 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Cost aware hyperparameter tuning algorithm☆171Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆292Updated last year
- Gradient Boosting Reinforcement Learning (GBRL)☆120Updated last month