PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
☆79Dec 18, 2025Updated 2 months ago
Alternatives and similar repositories for jetstream-pytorch
Users that are interested in jetstream-pytorch are comparing it to the libraries listed below
Sorting:
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆413Jan 5, 2026Updated last month
- ☆19Nov 5, 2025Updated 3 months ago
- Google TPU optimizations for transformers models☆134Jan 23, 2026Updated last month
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Feb 5, 2026Updated 3 weeks ago
- ☆15Updated this week
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Jul 4, 2025Updated 7 months ago
- ☆28Jun 3, 2024Updated last year
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆178Feb 21, 2026Updated last week
- JAX backend for SGL☆241Updated this week
- A simple, performant and scalable Jax LLM!☆2,148Updated this week
- Minimal yet performant LLM examples in pure JAX☆240Jan 14, 2026Updated last month
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆58Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆117Updated this week
- PyTorch distributed training acceleration framework☆54Aug 13, 2025Updated 6 months ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆264Updated this week
- ☆45Aug 13, 2025Updated 6 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆33Jan 13, 2025Updated last year
- ☆27Oct 26, 2024Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 5 months ago
- Mixed precision training from scratch with Tensors and CUDA☆28May 14, 2024Updated last year
- FP4 MAC Array☆19Apr 14, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆693Jan 26, 2026Updated last month
- ☆27Dec 23, 2025Updated 2 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆549Updated this week
- JAX Implementations of Descript Audio Codec and EnCodec☆33Mar 30, 2025Updated 11 months ago
- Agentic Research and Evaluation Suite☆75Updated this week
- An IR for efficiently simulating distributed ML computation.☆32Jan 13, 2024Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆192Feb 16, 2026Updated last week
- A simple, easy-to-understand library for diffusion models using Flax and Jax. Includes detailed notebooks on DDPM, DDIM, and EDM with sim…☆41May 6, 2025Updated 9 months ago
- torchprime is a reference model implementation for PyTorch on TPU.☆46Feb 18, 2026Updated last week
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆32Sep 13, 2023Updated 2 years ago
- ☆39Aug 1, 2025Updated 7 months ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆35Apr 25, 2024Updated last year
- ☆34May 14, 2025Updated 9 months ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆482Updated this week