LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.
☆5,557Jun 12, 2026Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,551Updated this week
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆23,734Updated this week
- Let's use Qualcomm NPU in Android☆20Feb 18, 2025Updated last year
- Support PyTorch model conversion with LiteRT.☆1,045Updated this week
- Official inference framework for 1-bit LLMs☆39,294Mar 10, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆156Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,733Updated this week
- LLM inference in C/C++☆116,603Updated this week
- FastRPC is Qualcomm's userspace library that facilitates efficient remote procedure calls between the CPU and DSP for high-performance co…☆96Jun 8, 2026Updated last week
- an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM☆49,724Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆66,620Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆17,874Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆5,066Updated this week
- ☆341Jun 10, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,949Updated this week
- Open-Source Frontier Voice AI☆49,426May 6, 2026Updated last month
- Universal LLM Deployment Engine with ML Compilation☆22,792May 11, 2026Updated last month
- llama.cpp fork with additional SOTA quants and improved performance☆2,737Updated this week
- PersonaPlex code.☆9,999Mar 2, 2026Updated 3 months ago
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆173,937Updated this week
- MCP Toolbox for Databases is an open source MCP server for databases.☆15,602Updated this week
- Python tool for converting files and office documents to Markdown.☆152,866May 26, 2026Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆83,135Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆71,436Updated this week
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆45Updated this week
- Examples and guides for using the Gemini API☆17,418Jun 10, 2026Updated last week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆7,343Jun 6, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆5,101Updated this week
- Network for procedural editing of text with LLMs☆23Apr 28, 2026Updated last month
- Run frontier AI locally.☆45,365Updated this week
- Lightweight coding agent that runs in your terminal☆91,652Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆50,785Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tensor library for machine learning☆14,804Updated this week
- Port of OpenAI's Whisper model in C/C++☆50,829Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,915Apr 13, 2026Updated 2 months ago
- LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.☆46,908Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆99,362Updated this week
- Universal memory layer for AI Agents☆58,750Updated this week
- Run LLMs with MLX☆5,825Updated this week