Minimal library to train LLMs on TPU in JAX with pjit().
☆301Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for jaxformer
Users that are interested in jaxformer are comparing it to the libraries listed below
Sorting:
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,170Oct 27, 2025Updated 4 months ago
- ☆63Mar 4, 2022Updated 4 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆695Jan 26, 2026Updated last month
- A simple, performant and scalable Jax LLM!☆2,156Updated this week
- JMP is a Mixed Precision library for JAX.☆211Jan 30, 2025Updated last year
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 7 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,517Aug 13, 2024Updated last year
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆550Feb 26, 2026Updated last week
- ☆78Dec 7, 2023Updated 2 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆439Feb 27, 2026Updated last week
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆561Jan 21, 2025Updated last year
- ☆367Apr 12, 2024Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆594Feb 3, 2026Updated last month
- CodeGen2 models for program synthesis☆271Jun 12, 2023Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Jul 21, 2022Updated 3 years ago
- JAX-Toolbox☆386Updated this week
- ☆16Jul 8, 2024Updated last year
- Training and serving large-scale neural networks with auto parallelization.☆3,184Dec 9, 2023Updated 2 years ago
- ☆2,950Jan 15, 2026Updated last month
- CLU lets you write beautiful training loops in JAX.☆367Feb 27, 2026Updated last week
- LoRA for arbitrary JAX models and functions☆145Feb 26, 2024Updated 2 years ago
- JAX Synergistic Memory Inspector☆184Jul 16, 2024Updated last year
- ☆193Feb 27, 2026Updated last week
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ☆13May 8, 2023Updated 2 years ago
- JAX implementation of the Mistral 7b v0.2 model☆35Jul 3, 2024Updated last year
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- ☆259Jun 6, 2025Updated 9 months ago
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- minGPT in JAX☆48Jan 10, 2022Updated 4 years ago
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆518Jun 19, 2024Updated last year
- ☆1,636Apr 27, 2023Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆265Mar 21, 2025Updated 11 months ago
- Generative model for code infilling and synthesis☆315Sep 9, 2023Updated 2 years ago
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 4 months ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,395Feb 3, 2026Updated last month