☆106Jun 11, 2026Updated this week
Alternatives and similar repositories for tpu-recipes
Users that are interested in tpu-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- torchprime is a reference model implementation for PyTorch on TPU.☆47Mar 3, 2026Updated 3 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆183May 14, 2026Updated last month
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133Updated this week
- ☆12Nov 13, 2024Updated last year
- ☆15Apr 8, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Oct 24, 2023Updated 2 years ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆445Jan 5, 2026Updated 5 months ago
- ☆34Oct 31, 2025Updated 7 months ago
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆101Updated this week
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆365Updated this week
- ☆583Jul 11, 2024Updated last year
- JAX backend for SGL☆280Updated this week
- A simple, performant and scalable Jax LLM!☆2,311Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆49May 5, 2026Updated last month
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 10 months ago
- ☆16Mar 13, 2025Updated last year
- ☆11Jul 30, 2025Updated 10 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- Minimal yet performant LLM examples in pure JAX☆256Apr 10, 2026Updated 2 months ago
- ☆15Jan 10, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- Google TPU optimizations for transformers models☆137Jan 23, 2026Updated 4 months ago
- Visual Inspection AI Edge solution infrastructure provisioning scripts☆17Nov 12, 2024Updated last year
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆254Updated this week
- Modular, scalable library to train ML models☆256May 29, 2026Updated 2 weeks ago
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated 4 months ago
- Contextualized per-token embeddings☆36Jun 5, 2026Updated last week
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 9 months ago
- ☆36Jun 6, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆64Updated this week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- 免费梯子,免费VPN,真正免费的的VPN,shadowsocks,v2rey,官网地址www.dragonvpn.cc☆13Sep 4, 2024Updated last year
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆82Dec 18, 2025Updated 5 months ago
- Linux workshop for Enterprise administrators☆13Jun 15, 2022Updated 3 years ago
- ☆31Updated this week
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year