evanatyourservice / psgd_jaxLinks

Implementation of PSGD optimizer in JAX

☆34

Alternatives and similar repositories for psgd_jax

Users that are interested in psgd_jax are comparing it to the libraries listed below

Sorting:

young-geng / scalax
A simple library for scaling up JAX programs
☆140Updated 9 months ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated 2 weeks ago
JesseFarebro / flax-mup
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Updated last year
BirkhoffG / jax-dataloader
Pytorch-like dataloaders for JAX.
☆94Updated 2 months ago
kvfrans / splus
☆115Updated last month
jax-ml / jax-llm-examples
☆141Updated this week
lindermanlab / elk
Scalable and Stable Parallelization of Nonlinear RNNS
☆17Updated 6 months ago
dlwh / jax_sourceror
Turn jitted jax functions back into python source code
☆22Updated 7 months ago
jenkspt / gpt-jax
Jax/Flax rewrite of Karpathy's nanoGPT
☆59Updated 2 years ago
google-deepmind / nanodo
☆275Updated last year
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated 2 weeks ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 7 months ago
modula-systems / modula
🧱 Modula software package
☆216Updated last week
nikhilvyas / SOAP
☆206Updated 8 months ago
davisyoshida / qax
If it quacks like a tensor...
☆58Updated 8 months ago
kvfrans / jax-flow
Flow-matching algorithms in JAX
☆100Updated 11 months ago
radarFudan / mamba-minimal-jax
☆31Updated 8 months ago
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆177Updated last year
GallagherCommaJack / modulax
☆17Updated 11 months ago
martin-marek / batch-size
📄Small Batch Size Training for Language Models
☆36Updated last week
google-research / jaxpruner
☆232Updated 5 months ago
johnryan465 / pscan
☆40Updated last year
samuela / torch2jax
Run PyTorch in JAX. 🤝
☆266Updated 3 weeks ago
AllanYangZhou / universal_neural_functional
☆51Updated last year
andyljones / boardlaw
Scaling scaling laws with board games.
☆51Updated 2 years ago
cgarciae / nanoGPT-jax
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆35Updated last year
AllanYangZhou / midGPT
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆24Updated 10 months ago
google-deepmind / jmp
JMP is a Mixed Precision library for JAX.
☆207Updated 6 months ago
young-geng / tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Updated last year