☆18Aug 24, 2024Updated last year
Alternatives and similar repositories for modulax
Users that are interested in modulax are comparing it to the libraries listed below
Sorting:
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- ☆12Apr 26, 2024Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Implementation of various equivariant models in JAX☆12Apr 12, 2024Updated last year
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- nanoGPT using Equinox☆15Mar 3, 2023Updated 2 years ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 5 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- ☆16Dec 30, 2024Updated last year
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆18Jun 30, 2021Updated 4 years ago
- ☆19Dec 4, 2025Updated 2 months ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated last year
- ☆23Jun 18, 2024Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Jul 28, 2023Updated 2 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated last year
- ☆33Oct 4, 2024Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 8 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Jul 27, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆47Updated this week
- ☆33Jan 14, 2021Updated 5 years ago
- ☆28Nov 18, 2022Updated 3 years ago
- ☆31Jan 23, 2026Updated last month
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆36Oct 29, 2024Updated last year
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆39Jun 16, 2024Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Feb 9, 2026Updated 3 weeks ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year