AIWintermuteAI / whispercppLinks
Pybind11 bindings for Whisper.cpp
☆57Updated last month
Alternatives and similar repositories for whispercpp
Users that are interested in whispercpp are comparing it to the libraries listed below
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆116Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- streaming speech to text server using Whisper☆92Updated 2 years ago
- faster-whisper as serverless endpoint☆102Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 8 months ago
- Something similar to Apple Intelligence?☆60Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆368Updated 11 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆108Updated 10 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆51Updated last year
- A simple TTS server for generating speech using StyleTTS2☆38Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated last year
- ☆53Updated last year
- ☆51Updated 8 months ago
- On-device speaker recognition engine powered by deep learning☆35Updated 3 weeks ago
- FastAPI service on top of WhisperX☆101Updated this week
- ☆156Updated last year
- whisper.cpp bindings for python☆96Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 11 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆56Updated last month
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆63Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆84Updated 3 weeks ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆80Updated 5 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- ☆157Updated 10 months ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆39Updated last year
- Complex RAG backend☆28Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆119Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year