LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization
☆2,349May 7, 2026Updated this week
Alternatives and similar repositories for LiteRT
Users that are interested in LiteRT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Support PyTorch model conversion with LiteRT.☆1,015May 1, 2026Updated last week
- ☆294Apr 30, 2026Updated last week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆128May 1, 2026Updated last week
- On-device AI across mobile, embedded and edge for PyTorch☆4,567Updated this week
- ☆4,820Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,332Updated this week
- A modern model graph visualizer and debugger☆1,463Updated this week
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,881Apr 22, 2026Updated 2 weeks ago
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆405Updated this week
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆139Updated this week
- TFLite Support is a toolkit that helps users to develop ML and deploy TFLite models onto mobile / ioT devices.☆436Mar 19, 2026Updated last month
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Sep 22, 2024Updated last year
- ☆15Dec 4, 2024Updated last year
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆1,020Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆22,751Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆20,409Updated this week
- ☆2,687Apr 9, 2026Updated 3 weeks ago
- Cross-platform, customizable ML solutions for live and streaming media.☆35,058Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆4,228May 1, 2026Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,748Updated this week
- ☆18Jul 22, 2025Updated 9 months ago
- On-device Neural Engine☆564Apr 26, 2026Updated last week
- Tensor library for machine learning☆14,594Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repository is an implementation of converting the YOLOv10 object detection model to LiteRT (.tflite) format and deploy it on Android…☆36Sep 25, 2024Updated last year
- Low-latency AI engine for mobile devices & wearables☆4,706Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,884Apr 28, 2026Updated last week
- Fast Multimodal LLM on Mobile Devices☆1,497Apr 30, 2026Updated last week
- Universal LLM Deployment Engine with ML Compilation☆22,598Apr 22, 2026Updated 2 weeks ago
- MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.☆15,126Updated this week
- Official inference framework for 1-bit LLMs☆38,861Mar 10, 2026Updated last month
- Let's use Qualcomm NPU in Android☆20Feb 18, 2025Updated last year
- LLM inference in C/C++☆107,892May 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A tool for converting ONNX files to LiteRT/TFLite/TensorFlow, PyTorch native code (nn.Module), TorchScript (.pt), state_dict (.pt), Expor…☆956Apr 1, 2026Updated last month
- ☆562Updated this week
- Development repository for the Triton language and compiler☆19,124Updated this week
- A python library for converting Pytorch modules into a circle model that is a lightweight and efficient representation in ONE designed fo…☆16Updated this week
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,442Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆78,979Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,638Feb 24, 2026Updated 2 months ago