Minimal but scalable implementation of large language models in JAX
☆35Nov 28, 2025Updated 3 months ago
Alternatives and similar repositories for mintext
Users that are interested in mintext are comparing it to the libraries listed below
Sorting:
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Sep 29, 2024Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Sep 24, 2025Updated 5 months ago
- ☆10Jun 27, 2024Updated last year
- Minimal yet performant LLM examples in pure JAX☆240Jan 14, 2026Updated last month
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆25Jan 14, 2025Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 3 months ago
- JAX Synergistic Memory Inspector☆184Jul 16, 2024Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 8 months ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 3 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated 11 months ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- GULAG: GUessing LAnGuages with neural networks☆13May 4, 2022Updated 3 years ago
- Pointax: PointMaze Environment for JAX☆26Oct 22, 2025Updated 4 months ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- seqax = sequence modeling + JAX☆171Jul 23, 2025Updated 7 months ago
- Modern, minimal, and modular LaTeX CV template ✨ 📄☆27Dec 4, 2025Updated 2 months ago
- A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…☆137Oct 16, 2025Updated 4 months ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 5 months ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 4 months ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆33Jul 8, 2025Updated 7 months ago
- ☆19May 20, 2025Updated 9 months ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Jun 11, 2023Updated 2 years ago
- ☆292Jul 15, 2024Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆41Jun 6, 2024Updated last year
- ☆22May 5, 2025Updated 9 months ago
- ☆18Aug 20, 2025Updated 6 months ago
- ☆16Jul 8, 2024Updated last year
- ☆44Updated this week
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆19Apr 22, 2024Updated last year