ayaka14732/llama-2-jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ayaka14732/llama-2-jax)

ayaka14732 / llama-2-jax

JAX implementation of the Llama 2 model

☆217

Alternatives and similar repositories for llama-2-jax

Users that are interested in llama-2-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sea-Snell / JAX_llama
View on GitHub
Inference code for LLaMA models in JAX
☆120May 21, 2024Updated 2 years ago
yixiaoer / mistral-v0.2-jax
View on GitHub
JAX implementation of the Mistral 7b v0.2 model
☆35Jul 3, 2024Updated 2 years ago
ayaka14732 / jax-smi
View on GitHub
JAX Synergistic Memory Inspector
☆186Jul 16, 2024Updated 2 years ago
young-geng / scalax
View on GitHub
A simple library for scaling up JAX programs
☆148Nov 4, 2025Updated 8 months ago
erfanzar / EasyDeL
View on GitHub
Accelerate, Optimize performance with streamlined training and serving options with JAX.
☆369Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yixiaoer / einshard
View on GitHub
Einsum-like high-level array sharding API for JAX
☆35Jul 16, 2024Updated 2 years ago
lucidrains / flash-attention-jax
View on GitHub
Implementation of Flash Attention in Jax
☆229Mar 1, 2024Updated 2 years ago
google / paxml
View on GitHub
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…
☆555Updated this week
ayaka14732 / tpu-starter
View on GitHub
Everything you want to know about Google Cloud TPU
☆571Jul 16, 2024Updated 2 years ago
irhum / esmjax
View on GitHub
ESM2 protein language models in JAX/Flax
☆19Oct 10, 2022Updated 3 years ago
davisyoshida / lorax
View on GitHub
LoRA for arbitrary JAX models and functions
☆143Feb 26, 2024Updated 2 years ago
jax-ml / jax-triton
View on GitHub
jax-triton contains integrations between JAX and OpenAI Triton
☆465Updated this week
AI-Hypercomputer / maxtext
View on GitHub
A simple, performant, and scalable Jax LLM!
☆2,368Updated this week
davisyoshida / qax
View on GitHub
If it quacks like a tensor...
☆59Nov 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Sea-Snell / JAXSeq
View on GitHub
Train very large language models in Jax.
☆208Oct 21, 2023Updated 2 years ago
google / jaxonnxruntime
View on GitHub
A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.
☆135Jul 6, 2026Updated 3 weeks ago
young-geng / EasyLM
View on GitHub
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,514Aug 13, 2024Updated last year
aniquetahir / JORA
View on GitHub
JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)
☆36Apr 25, 2024Updated 2 years ago
marin-community / levanter
View on GitHub
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆708Jan 26, 2026Updated 6 months ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
yixiaoer / tpux
View on GitHub
A set of Python scripts that makes your experience on TPU better
☆56Sep 18, 2025Updated 10 months ago
borisdayma / clip-jax
View on GitHub
Train vision models using JAX and 🤗 transformers
☆103Dec 14, 2025Updated 7 months ago
patil-suraj / stable-diffusion-jax
View on GitHub
☆91Sep 19, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pytorch-tpu / transformers
View on GitHub
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
☆17Jun 5, 2025Updated last year
lucidrains / PaLM-jax
View on GitHub
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆189Jun 24, 2022Updated 4 years ago
google / flaxformer
View on GitHub
☆371Apr 12, 2024Updated 2 years ago
pcuenca / lpips-j
View on GitHub
Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.
☆17Aug 1, 2022Updated 3 years ago
google-deepmind / jmp
View on GitHub
JMP is a Mixed Precision library for JAX.
☆213Jul 8, 2026Updated 3 weeks ago
erfanzar / eformer
View on GitHub
(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX
☆33Updated this week
kuprel / min-dalle-flax
View on GitHub
This contains the Flax model of min(DALL·E) and code for converting it to PyTorch
☆45Jul 21, 2022Updated 4 years ago
google / drjax
View on GitHub
☆19Jul 8, 2026Updated 3 weeks ago
ayaka14732 / bart-base-jax
View on GitHub
JAX implementation of the bart-base model
☆34Apr 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AllanYangZhou / midGPT
View on GitHub
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆27Sep 29, 2024Updated last year
google-research / jaxpruner
View on GitHub
☆237Feb 12, 2025Updated last year
patil-suraj / vit-vqgan
View on GitHub
JAX implementation ViT-VQGAN
☆82Sep 21, 2022Updated 3 years ago
google / orbax
View on GitHub
Orbax provides common checkpointing and persistence utilities for JAX users
☆525Updated this week
vvvm23 / mamba-jax
View on GitHub
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆94Jan 25, 2024Updated 2 years ago
kingoflolz / swarm-jax
View on GitHub
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆241May 12, 2023Updated 3 years ago
google / flax
View on GitHub
Flax is a neural network library for JAX that is designed for flexibility.
☆7,281Updated this week