☆990Mar 26, 2026Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,963Mar 20, 2026Updated last week
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- Support PyTorch model conversion with LiteRT.☆965Updated this week
- ☆185Mar 16, 2026Updated last week
- ☆16Feb 7, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆37Jul 14, 2025Updated 8 months ago
- Local LLM Testing & Benchmarking for Apple Silicon☆105Mar 19, 2026Updated last week
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated 3 weeks ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated last month
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆86Mar 17, 2026Updated last week
- Self-hosted agentic AI platform powered by local models☆38Feb 27, 2026Updated last month
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆46Mar 7, 2026Updated 2 weeks ago
- ☆32Jul 5, 2024Updated last year
- mnn asr demo.☆26Mar 24, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- llama.cpp fork with additional SOTA quants and improved performance☆1,846Mar 20, 2026Updated last week
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Oct 3, 2024Updated last year
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,281Updated this week
- ☆49Feb 19, 2026Updated last month
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆62Feb 20, 2026Updated last month
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆839Mar 19, 2026Updated last week
- Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)☆51Jul 6, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Work-in-progress vector search SQLite extension that runs anywhere.☆10Jul 27, 2024Updated last year
- ☆36Dec 16, 2025Updated 3 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,755Mar 19, 2026Updated last week
- ☆11Feb 5, 2026Updated last month
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 10 months ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Dec 16, 2024Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 4 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Stable Diffusion in pure C/C++☆62Aug 13, 2023Updated 2 years ago
- Official inference framework for 1-bit LLMs☆35,906Mar 10, 2026Updated 2 weeks ago
- Qwen2-VL for OCR & VQA☆19Sep 3, 2024Updated last year
- Thai Law Dataset (Act of Parliament)☆23Jul 21, 2021Updated 4 years ago
- Train Large Language Models on MLX.☆286Mar 11, 2026Updated 2 weeks ago
- ChatGPTをLINE botで触るハンズオン☆18Jun 28, 2023Updated 2 years ago
- A cross-platform library for communicating with devices over various physical links.☆17Mar 2, 2026Updated 3 weeks ago