☆18Aug 24, 2024Updated last year
Alternatives and similar repositories for modulax
Users that are interested in modulax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆19Dec 4, 2025Updated 5 months ago
- ☆12Apr 26, 2024Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Jul 28, 2023Updated 2 years ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 8 months ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- ☆33Oct 4, 2024Updated last year
- ☆16Dec 30, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago
- ☆24Jun 18, 2024Updated last year
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated last year
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- ☆33Nov 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- ☆39Jan 27, 2026Updated 3 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Apr 14, 2026Updated last month
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆35Oct 29, 2024Updated last year
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated 11 months ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An implementation of a Brownian motion using ClojureScript with re-frame and Highcharts☆11Feb 8, 2019Updated 7 years ago
- ☆35Dec 5, 2022Updated 3 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆40Jun 16, 2024Updated last year
- DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)☆44Feb 18, 2025Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆61Jul 27, 2024Updated last year
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 11 months ago
- A PyTorch implementation of the Exclusive Cross Entropy Loss.☆20Aug 12, 2022Updated 3 years ago