☆3,144Apr 9, 2026Updated last week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,143Apr 9, 2026Updated last week
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆20,307Apr 8, 2026Updated last week
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- Support PyTorch model conversion with LiteRT.☆988Apr 3, 2026Updated 2 weeks ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆116Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official inference framework for 1-bit LLMs☆38,049Mar 10, 2026Updated last month
- On-device AI across mobile, embedded and edge for PyTorch☆4,502Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆4,333Apr 9, 2026Updated last week
- LLM inference in C/C++☆103,237Updated this week
- ☆18Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆2,026Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.☆61,312Updated this week
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆38Jul 14, 2025Updated 9 months ago
- an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM☆42,084Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,846Apr 8, 2026Updated last week
- Run LLMs with MLX☆4,654Apr 8, 2026Updated last week
- The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.☆3,615Apr 9, 2026Updated last week
- Tensor library for machine learning☆14,394Apr 9, 2026Updated last week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆6,641Apr 7, 2026Updated last week
- Lightweight in-memory knowledge graph with Cypher query support☆19Updated this week
- Examples and guides for using the Gemini API☆16,991Apr 9, 2026Updated last week
- Universal LLM Deployment Engine with ML Compilation☆22,414Apr 6, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,887Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-Source Frontier Voice AI☆39,575Updated this week
- ☆270Mar 31, 2026Updated 2 weeks ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆169,151Updated this week
- Python tool for converting files and office documents to Markdown.☆100,294Mar 30, 2026Updated 2 weeks ago
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆855Apr 3, 2026Updated last week
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated last month
- FastRPC is Qualcomm's userspace library that facilitates efficient remote procedure calls between the CPU and DSP for high-performance co…☆84Apr 3, 2026Updated last week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆13,436Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,647Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- Run frontier AI locally.☆43,503Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76,536Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,309Updated this week
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆61,646Updated this week
- MCP Toolbox for Databases is an open source MCP server for databases.☆14,044Updated this week
- Port of OpenAI's Whisper model in C/C++☆48,661Mar 29, 2026Updated 2 weeks ago