A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆124Dec 29, 2025Updated 5 months ago
Alternatives and similar repositories for JAXformer
Users that are interested in JAXformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆52Mar 14, 2025Updated last year
- A collection of open-source large language model (LLM) implementations in JAX & Flax☆24Apr 1, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Dec 11, 2018Updated 7 years ago
- ☆79Feb 18, 2026Updated 3 months ago
- ☆15Mar 20, 2025Updated last year
- llms can learn their own context compression via RL☆43Nov 26, 2025Updated 6 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 6 months ago
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated last year
- Website for CSE 234, Winter 2025☆15Mar 24, 2025Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆203Jun 1, 2025Updated 11 months ago
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- ☆19Mar 16, 2025Updated last year
- Machine learning algorithms implements with jax for machine learning in production in large scale dataset.☆16Updated this week
- Contains JAX implementation of algorithms for inverse reinforcement learning☆78Aug 18, 2024Updated last year
- ☆29Dec 15, 2025Updated 5 months ago
- 100 days of building GPU kernels!☆598Apr 27, 2025Updated last year
- A list of active Hack Club open source repositories with available issues on GitHub☆14Apr 28, 2026Updated last month
- This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI☆10Feb 6, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Oct 6, 2023Updated 2 years ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Code and report for APMA136 Final Project☆19May 6, 2015Updated 11 years ago
- Implementation of the paper on Embodiment Scaling Laws in Robot Locomotion (CoRL 2025)☆26Sep 23, 2025Updated 8 months ago
- Hack Club Bank CLI☆10Jul 25, 2022Updated 3 years ago
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆29Mar 13, 2026Updated 2 months ago
- Website for Hack Club's Moonbeam project☆14Apr 4, 2022Updated 4 years ago
- ☆46May 20, 2026Updated last week
- ⌨️ the simplest and smallest code editor for web, with no dependencies - forked from spell 🪄☆14Nov 5, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Analyze and model weekly calendar distributions using latent components☆21May 18, 2026Updated last week
- 🦘Websites upside down for those down under!☆13Nov 20, 2021Updated 4 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Sep 24, 2025Updated 8 months ago
- ☆10Oct 22, 2024Updated last year
- Classes and methods for Geometric Deep Learning to support Substack, LinkedIn newsletters and tutorials☆29May 14, 2026Updated 2 weeks ago
- Neural ODE Transformers (ICLR 2025)☆21Sep 6, 2025Updated 8 months ago
- Numerical relativity surrogate waveform in Jax☆19Aug 14, 2025Updated 9 months ago