☆110Jul 1, 2026Updated this week
Alternatives and similar repositories for tpu-recipes
Users that are interested in tpu-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- torchprime is a reference model implementation for PyTorch on TPU.☆48Mar 3, 2026Updated 4 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆186Jun 25, 2026Updated last week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆134Updated this week
- ☆20Nov 28, 2024Updated last year
- ☆15Oct 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆448Jan 5, 2026Updated 5 months ago
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆101Updated this week
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆367Updated this week
- ☆590Jul 11, 2024Updated last year
- JAX backend for SGL☆295Updated this week
- ☆367Updated this week
- A simple, performant and scalable Jax LLM!☆2,338Jun 28, 2026Updated last week
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 11 months ago
- ☆16Mar 13, 2025Updated last year
- The training codes of Jasper-Token-Compression-600M☆20Nov 19, 2025Updated 7 months ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆32Mar 1, 2025Updated last year
- Official code release for "SuperBPE: Space Travel for Language Models"☆96May 28, 2026Updated last month
- Google TPU optimizations for transformers models☆135Jan 23, 2026Updated 5 months ago
- ☆10Jul 18, 2018Updated 7 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆254Jun 22, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PathwaysJob API is an OSS Kubernetes-native API, to deploy ML training and batch inference workloads, using Pathways on GKE.☆20Oct 22, 2025Updated 8 months ago
- Modular, scalable library to train ML models☆277Updated this week
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated 5 months ago
- Contextualized per-token embeddings☆37Jun 23, 2026Updated last week
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 10 months ago
- ☆38Updated this week
- Go Wrappers for OpenXLA PJRT☆39Dec 12, 2025Updated 6 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- 免费梯子,免费VPN,真正免费的的VPN,shadowsocks,v2rey,官网地址www.dragonvpn.cc☆13Sep 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆84Dec 18, 2025Updated 6 months ago
- ☆16Apr 3, 2025Updated last year
- ☆31Jun 24, 2026Updated last week
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- RAD Lab enables users to deploy infrastructure on Google Cloud Platform (GCP) to support specific use cases. Infrastructure is created an…☆114Jun 18, 2026Updated 2 weeks ago
- This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI☆10Feb 6, 2026Updated 4 months ago
- A utility to inspect, validate, sign and verify machine learning model files.☆67Feb 5, 2025Updated last year