google-research / kauldronLinks
Modular, scalable library to train ML models
☆186Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- a Jax quantization library☆83Updated this week
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 5 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- ☆294Updated this week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆155Updated 2 months ago
- ☆191Updated 3 weeks ago
- ☆287Updated last year
- For optimization algorithm research and development.☆556Updated 3 weeks ago
- Minimal yet performant LLM examples in pure JAX☆225Updated last week
- ☆214Updated this week
- Scalable and Performant Data Loading☆360Updated last week
- Implementation of Diffusion Transformer (DiT) in JAX☆300Updated last year
- ☆314Updated last year
- 🧱 Modula software package☆322Updated 4 months ago
- Cost aware hyperparameter tuning algorithm☆177Updated last year
- Dion optimizer algorithm☆413Updated this week
- ☆118Updated last month
- Attention Kernels for Symmetric Power Transformers☆128Updated 3 months ago
- RLP: Reinforcement as a Pretraining Objective☆222Updated 3 months ago
- Library for reading and processing ML training data.☆643Updated last week
- JAX-Toolbox☆373Updated this week
- ☆150Updated 4 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆305Updated 3 weeks ago
- ☆158Updated 2 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆328Updated last week
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆121Updated last month
- seqax = sequence modeling + JAX☆169Updated 5 months ago
- ☆82Updated last year
- Google TPU optimizations for transformers models☆133Updated 3 weeks ago