Picovoice / picollmLinks
On-device LLM Inference Powered by X-Bit Quantization
☆272Updated 3 months ago
Alternatives and similar repositories for picollm
Users that are interested in picollm are comparing it to the libraries listed below
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆122Updated 2 months ago
- Recipes for on-device voice AI and local LLM☆98Updated 4 months ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 9 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆225Updated last year
- A platform to self-host AI on easy mode☆173Updated this week
- Local ML voice chat using high-end models.☆177Updated 2 weeks ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆264Updated 8 months ago
- Running a LLM on the ESP32☆79Updated last year
- Run LLMs in the Browser with MLC / WebLLM ✨☆141Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆340Updated 6 months ago
- ☆207Updated 2 months ago
- Locally running LLM with internet access☆97Updated 4 months ago
- Replace OpenAI with Llama.cpp Automagically.☆325Updated last year
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆620Updated last year
- Code for Papeg.ai☆225Updated 10 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆277Updated 2 months ago
- Open source LLM UI, compatible with all local LLM providers.☆176Updated last year
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆273Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆46Updated last month
- Something similar to Apple Intelligence?☆61Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆234Updated 6 months ago
- A fast batching API to serve LLM models☆188Updated last year
- ☆456Updated this week
- ☆133Updated 6 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆111Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆191Updated last year
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆924Updated last month
- API Server for Transformer Lab☆78Updated this week
- ☆91Updated 5 months ago