google-research / kauldronLinks
Modular, scalable library to train ML models
☆121Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- ☆131Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆128Updated 3 weeks ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆91Updated 2 months ago
- 🧱 Modula software package☆194Updated 2 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆92Updated this week
- ☆269Updated 10 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆101Updated last month
- ☆118Updated 2 weeks ago
- Google TPU optimizations for transformers models☆112Updated 4 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆33Updated 7 months ago
- ☆182Updated this week
- A simple library for scaling up JAX programs☆137Updated 7 months ago
- ☆179Updated this week
- ☆47Updated 7 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆139Updated 2 weeks ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆100Updated last month
- ☆51Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆100Updated 3 months ago
- ☆78Updated 11 months ago
- Scalable and Performant Data Loading☆269Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆87Updated last month
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated last year
- A set of Python scripts that makes your experience on TPU better☆54Updated 11 months ago
- DeMo: Decoupled Momentum Optimization☆188Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆287Updated 9 months ago
- ☆303Updated 11 months ago
- ☆114Updated this week
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month