Picovoice / picollm
On-device LLM Inference Powered by X-Bit Quantization
☆224Updated last week
Alternatives and similar repositories for picollm:
Users that are interested in picollm are comparing it to the libraries listed below
- On-device streaming text-to-speech engine powered by deep learning☆73Updated last week
- Recipes for on-device voice AI and local LLM☆79Updated last week
- A multimodal, function calling powered LLM webui.☆215Updated 6 months ago
- A fast batching API to serve LLM models☆183Updated 11 months ago
- Lightweight Inference server for OpenVINO☆142Updated this week
- ☆234Updated 4 months ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- Replace OpenAI with Llama.cpp Automagically.☆311Updated 9 months ago
- Efficient Inference of Transformer models☆427Updated 7 months ago
- A mobile Implementation of llama.cpp☆305Updated last year
- automatically quant GGUF models☆164Updated this week
- Local ML voice chat using high-end models.☆162Updated this week
- ☆201Updated 10 months ago
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 7 months ago
- Turns devices into a scalable LLM platform☆127Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆222Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆243Updated 3 weeks ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆152Updated 10 months ago
- ☆197Updated last week
- ☆91Updated 2 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆547Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆281Updated last week
- A Conversational Speech Generation Model with Gradio UI and support for CUDA, MLX and CPU devices☆131Updated this week
- ☆83Updated 3 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆71Updated last month
- Efficient visual programming for AI language models☆353Updated 6 months ago
- Run Orpheus 3B Locally With LM Studio☆277Updated last week
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- A local and uncensored AI entity.☆69Updated last month