Google TPU optimizations for transformers models
☆134Jan 23, 2026Updated last month
Alternatives and similar repositories for optimum-tpu
Users that are interested in optimum-tpu are comparing it to the libraries listed below
Sorting:
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆414Jan 5, 2026Updated 2 months ago
- ☆314Updated this week
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 9 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Dec 18, 2025Updated 2 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆23Jun 8, 2025Updated 8 months ago
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated 2 years ago
- A simple, performant and scalable Jax LLM!☆2,156Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆170Updated this week
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 7 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆36Oct 16, 2025Updated 4 months ago
- ☆16Jun 6, 2023Updated 2 years ago
- ☆27Dec 23, 2025Updated 2 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 6 months ago
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆29Feb 21, 2026Updated last week
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆192Feb 16, 2026Updated 2 weeks ago
- Replicate interface for IF☆10May 17, 2023Updated 2 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- A pytorch quantization backend for optimum☆1,030Nov 21, 2025Updated 3 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆549Feb 26, 2026Updated last week
- Tokamax: A GPU and TPU kernel library.☆179Updated this week
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆161Apr 3, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- 👷 Build compute kernels☆216Jan 27, 2026Updated last month
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated last year
- Unveiling the Layers: Neural Networks from first principles☆11Oct 1, 2025Updated 5 months ago
- An engine for fast time series data aggregation☆13Jan 8, 2026Updated last month
- ☆16Feb 18, 2026Updated 2 weeks ago
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆282Jul 11, 2024Updated last year
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆247Updated this week
- Training and inference on AWS Trainium and Inferentia chips.☆261Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Oct 28, 2024Updated last year
- A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple form…☆18Mar 5, 2025Updated last year