google-research / kauldronLinks
Modular, scalable library to train ML models
☆204Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆98Updated 6 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆300Updated last year
- a Jax quantization library☆90Updated this week
- For optimization algorithm research and development.☆558Updated 3 weeks ago
- ☆192Updated last week
- ☆214Updated 2 weeks ago
- 🧱 Modula software package☆322Updated 5 months ago
- ☆307Updated this week
- Dion optimizer algorithm☆431Updated 3 weeks ago
- Minimal yet performant LLM examples in pure JAX☆240Updated 3 weeks ago
- ☆291Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆306Updated last year
- ☆316Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆130Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆179Updated last year
- Getting crystal-like representations with harmonic loss☆195Updated 10 months ago
- ☆153Updated 5 months ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆158Updated 2 months ago
- ☆82Updated last year
- PyTorch implementation of models from the Zamba2 series.☆186Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Updated 2 months ago
- Scalable and Performant Data Loading☆364Updated last week
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- JAX-Toolbox☆382Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆336Updated last week
- Large multi-modal models (L3M) pre-training.☆230Updated 4 months ago
- ☆166Updated 3 months ago
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆218Updated last month
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆131Updated this week
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆115Updated last month