☆877Mar 6, 2026Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,527Updated this week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆102Updated this week
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- Support PyTorch model conversion with LiteRT.☆944Feb 28, 2026Updated last week
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆15,213Feb 26, 2026Updated last week
- llama.cpp fork with additional SOTA quants and improved performance☆1,696Feb 28, 2026Updated last week
- A high-performance, thread-safe HashMap and LRU cache for Rust with fine-grained per-key locking.☆10Updated this week
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Knox is a vigilant supervisor and management tool that ensures LLM teams rigorously develop reliable AI Agent programming extensions for …☆33Feb 25, 2026Updated last week
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Updated this week
- python越南语分词器☆10Nov 14, 2019Updated 6 years ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆36Jul 14, 2025Updated 7 months ago
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆60Feb 20, 2026Updated 2 weeks ago
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆84Nov 25, 2025Updated 3 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 4 months ago
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 3 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Oct 3, 2024Updated last year
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆821Feb 23, 2026Updated last week
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 7 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- ☆15Feb 16, 2025Updated last year
- A cross-platform GUI application for easily downloading Hugging Face models without requiring technical knowledge or setup.☆23Nov 26, 2025Updated 3 months ago
- Ready-to-use agent that can interact directly with any tool or native endpoint, in less than 5 lines of code☆45Oct 16, 2025Updated 4 months ago
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 10 months ago
- This Streamlit application allows users to upload images and engage in interactive conversations about them using the Ollama Vision Model…☆15Nov 11, 2024Updated last year
- ☆249Feb 26, 2026Updated last week
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 3 weeks ago
- Let's sudo by face recognition of Windows Hello on Windows Subsystem for Linux (WSL). It runs on both WSL 1 and WSL 2. This is a PAM modu…☆24Updated this week
- LLVM compiler for python☆12Dec 7, 2024Updated last year
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Jan 4, 2025Updated last year
- ☆15May 27, 2020Updated 5 years ago
- Official inference framework for 1-bit LLMs☆28,697Feb 3, 2026Updated last month
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,267Updated this week
- Efficient non-uniform quantization with GPTQ for GGUF☆60Sep 17, 2025Updated 5 months ago
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆20Jul 2, 2025Updated 8 months ago
- ☆17Sep 24, 2024Updated last year
- Is a high-performance Augmented Recovery-Generation (RAG) solution based on Redis, Qdrant or PostgreSQL. It offers a high-level interface…☆30Jan 6, 2026Updated 2 months ago