google-ai-edge / LiteRT-LMLinks
☆477Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below
Sorting:
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆570Updated 2 weeks ago
- ☆702Updated last month
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆943Updated this week
- Sparse Inferencing for transformer based LLMs☆201Updated 3 months ago
- A command-line interface tool for serving LLM using vLLM.☆441Updated 3 weeks ago
- ☆315Updated this week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆155Updated last week
- Train Large Language Models on MLX.☆213Updated last week
- ☆300Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆275Updated 4 months ago
- ☆158Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆343Updated 7 months ago
- Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere☆681Updated this week
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆939Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆332Updated 8 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆225Updated last year
- Kyutai with an "eye"☆223Updated 7 months ago
- Fast parallel LLM inference for MLX☆227Updated last year
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆689Updated 4 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆380Updated 3 months ago
- LM Studio Apple MLX engine☆816Updated last week
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆274Updated last week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆1,058Updated last week
- Qwen Image models through MPS☆220Updated 2 weeks ago
- API Server for Transformer Lab☆78Updated this week
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆223Updated 2 weeks ago
- LLM inference in C/C++☆103Updated last week
- Docs for GGUF quantization (unofficial)☆312Updated 3 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆201Updated 2 weeks ago
- Big & Small LLMs working together☆1,194Updated last week