☆79Mar 18, 2026Updated this week
Alternatives and similar repositories for tpu-recipes
Users that are interested in tpu-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆172Updated this week
- ☆12Nov 13, 2024Updated last year
- torchprime is a reference model implementation for PyTorch on TPU.☆46Mar 3, 2026Updated 3 weeks ago
- ☆14Mar 2, 2026Updated 3 weeks ago
- ☆15Oct 24, 2023Updated 2 years ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆416Jan 5, 2026Updated 2 months ago
- Automated Quality Control for Dialogflow CX Agents☆14May 3, 2024Updated last year
- ☆81Mar 16, 2026Updated last week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆342Updated this week
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- JAX backend for SGL☆252Updated this week
- ☆570Jul 11, 2024Updated last year
- ☆318Updated this week
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- A simple, performant and scalable Jax LLM!☆2,182Updated this week
- ☆16Mar 13, 2025Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆37Jan 27, 2024Updated 2 years ago
- Minimal yet performant LLM examples in pure JAX☆245Jan 14, 2026Updated 2 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- ☆15Jan 10, 2025Updated last year
- Official code release for "SuperBPE: Space Travel for Language Models"☆90Jan 9, 2026Updated 2 months ago
- Google TPU optimizations for transformers models☆136Jan 23, 2026Updated 2 months ago
- ☆10Jul 18, 2018Updated 7 years ago
- PathwaysJob API is an OSS Kubernetes-native API, to deploy ML training and batch inference workloads, using Pathways on GKE.☆19Oct 22, 2025Updated 5 months ago
- ☆33Feb 4, 2026Updated last month
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated last month
- ☆34Sep 10, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Dec 18, 2025Updated 3 months ago
- Linux workshop for Enterprise administrators☆13Jun 15, 2022Updated 3 years ago
- ☆27Updated this week
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- RAD Lab enables users to deploy infrastructure on Google Cloud Platform (GCP) to support specific use cases. Infrastructure is created an…☆112Updated this week
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- A place to store scripts and things☆13May 1, 2025Updated 10 months ago