google-research / kauldron
Modular, scalable library to train ML models
☆111Updated this week
Alternatives and similar repositories for kauldron
Users that are interested in kauldron are comparing it to the libraries listed below
Sorting:
- ☆109Updated this week
- ☆129Updated last month
- A simple library for scaling up JAX programs☆134Updated 6 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆32Updated 6 months ago
- ☆217Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆120Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆123Updated last year
- ☆81Updated last year
- ☆79Updated 10 months ago
- ☆94Updated 3 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated this week
- ☆205Updated this week
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated 11 months ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated 7 months ago
- ☆47Updated 6 months ago
- A set of Python scripts that makes your experience on TPU better☆53Updated 10 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆91Updated last month
- ☆53Updated last year
- 🧱 Modula software package☆189Updated last month
- Learn online intrinsic rewards from LLM feedback☆37Updated 4 months ago
- seqax = sequence modeling + JAX☆155Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆97Updated 3 weeks ago
- Mobile Viewer for W&B, built on top of Flutter.☆34Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 10 months ago
- Library for reading and processing ML training data.☆441Updated this week
- ☆43Updated last year
- PyTorch implementation of models from the Zamba2 series.☆181Updated 3 months ago
- Multi-backend recommender systems with Keras 3☆107Updated last week
- ☆222Updated 2 months ago