evanatyourservice / psgd_jaxLinks
Implementation of PSGD optimizer in JAX
☆34Updated 7 months ago
Alternatives and similar repositories for psgd_jax
Users that are interested in psgd_jax are comparing it to the libraries listed below
Sorting:
- A simple library for scaling up JAX programs☆140Updated 9 months ago
- LoRA for arbitrary JAX models and functions☆140Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 weeks ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Pytorch-like dataloaders for JAX.☆94Updated 2 months ago
- ☆115Updated last month
- ☆141Updated this week
- Scalable and Stable Parallelization of Nonlinear RNNS☆17Updated 6 months ago
- Turn jitted jax functions back into python source code☆22Updated 7 months ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆59Updated 2 years ago
- ☆275Updated last year
- seqax = sequence modeling + JAX☆165Updated 2 weeks ago
- supporting pytorch FSDP for optimizers☆84Updated 7 months ago
- 🧱 Modula software package☆216Updated last week
- ☆206Updated 8 months ago
- If it quacks like a tensor...☆58Updated 8 months ago
- Flow-matching algorithms in JAX☆100Updated 11 months ago
- ☆31Updated 8 months ago
- JAX Synergistic Memory Inspector☆177Updated last year
- ☆17Updated 11 months ago
- 📄Small Batch Size Training for Language Models☆36Updated last week
- ☆232Updated 5 months ago
- ☆40Updated last year
- Run PyTorch in JAX. 🤝☆266Updated 3 weeks ago
- ☆51Updated last year
- Scaling scaling laws with board games.☆51Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆35Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated 10 months ago
- JMP is a Mixed Precision library for JAX.☆207Updated 6 months ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year