Picovoice / picollm
On-device LLM Inference Powered by X-Bit Quantization
☆234Updated 2 weeks ago
Alternatives and similar repositories for picollm:
Users that are interested in picollm are comparing it to the libraries listed below
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆181Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆76Updated this week
- Recipes for on-device voice AI and local LLM☆81Updated last month
- Lightweight Inference server for OpenVINO☆160Updated last week
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 7 months ago
- ☆130Updated last week
- ☆204Updated 10 months ago
- Run Orpheus 3B Locally With LM Studio☆379Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆221Updated 3 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆260Updated 2 weeks ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆150Updated 3 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆118Updated last year
- Interface for OuteTTS models.☆1,178Updated last week
- Awesome Mobile LLMs☆169Updated last month
- Replace OpenAI with Llama.cpp Automagically.☆315Updated 10 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.☆56Updated 2 months ago
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 8 months ago
- FastMLX is a high performance production ready API to host MLX models.☆293Updated last month
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆552Updated 2 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆200Updated 11 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆580Updated 5 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- ☆198Updated this week
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆117Updated 6 months ago
- ML-powered speech synthesis directly in your browser☆147Updated 2 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆762Updated 8 months ago
- Train your own small bitnet model☆67Updated 6 months ago
- ☆46Updated 6 months ago
- A mobile Implementation of llama.cpp☆308Updated last year