google-ai-edge / LiteRT-LMLinks
☆728Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below
Sorting:
- ☆170Updated last week
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆587Updated 3 weeks ago
- ☆720Updated last month
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆977Updated last month
- A command-line interface tool for serving LLM using vLLM.☆461Updated last month
- ☆431Updated last month
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆855Updated this week
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆806Updated last week
- On-device LLM Inference Powered by X-Bit Quantization☆276Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆277Updated 6 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆347Updated 9 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆356Updated last week
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆331Updated this week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,572Updated 2 weeks ago
- ☆159Updated last month
- Awesome Mobile LLMs☆290Updated last month
- ☆461Updated last week
- Community maintained hardware plugin for vLLM on Apple Silicon☆260Updated this week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆560Updated last month
- Train Large Language Models on MLX.☆240Updated last week
- ☆2,256Updated last month
- No-code CLI designed for accelerating ONNX workflows☆224Updated 7 months ago
- Building blocks for agents in C++☆131Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 4 months ago
- ☆301Updated 5 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆694Updated 6 months ago
- Sparse Inferencing for transformer based LLMs☆217Updated 5 months ago
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆443Updated 4 months ago
- Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https…☆2,008Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆414Updated this week