PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
☆82Dec 18, 2025Updated 5 months ago
Alternatives and similar repositories for jetstream-pytorch
Users that are interested in jetstream-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆443Jan 5, 2026Updated 5 months ago
- Google TPU optimizations for transformers models☆137Jan 23, 2026Updated 4 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64May 5, 2026Updated last month
- ☆21Apr 27, 2026Updated last month
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆227May 19, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a Jax quantization library☆120Updated this week
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- ☆56Apr 23, 2024Updated 2 years ago
- ☆19Oct 6, 2023Updated 2 years ago
- MLIR-based partitioning system☆190Updated this week
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆77Jun 3, 2026Updated last week
- ☆16Apr 10, 2022Updated 4 years ago
- A simple, performant and scalable Jax LLM!☆2,311Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆24Jul 4, 2025Updated 11 months ago
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- Android ORM framework.☆20Jul 1, 2015Updated 10 years ago
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- ☆32Jun 3, 2024Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆37Mar 30, 2025Updated last year
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- JAX backend for SGL☆280Updated this week
- Julia package for Probabilistic Canonical Correlation Analysis☆12Mar 30, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆104Updated this week
- ☆29Oct 26, 2024Updated last year
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆709Jan 26, 2026Updated 4 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆183May 14, 2026Updated 3 weeks ago
- ☆14Nov 28, 2022Updated 3 years ago
- ☆153May 29, 2026Updated last week
- ☆20May 30, 2026Updated last week
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆294Updated this week
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆32Sep 13, 2023Updated 2 years ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆35Jan 13, 2025Updated last year
- ☆34May 14, 2025Updated last year
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 3 years ago
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133Updated this week