AIWintermuteAI / whispercpp
Pybind11 bindings for Whisper.cpp
☆57Updated this week
Alternatives and similar repositories for whispercpp:
Users that are interested in whispercpp are comparing it to the libraries listed below
- Whisper realtime streaming for long speech-to-text transcription and translation☆114Updated last year
- streaming speech to text server using Whisper☆91Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- faster-whisper as serverless endpoint☆96Updated this week
- Download models from the Ollama library, without Ollama☆70Updated 5 months ago
- FastAPI service on top of WhisperX☆92Updated this week
- Something similar to Apple Intelligence?☆60Updated 10 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 7 months ago
- ☆153Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- An API for VoiceCraft.☆25Updated 10 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆48Updated 3 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- A simple TTS server for generating speech using StyleTTS2☆38Updated last year
- ☆204Updated 11 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆212Updated 3 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- All the world is a play, we are but actors in it.☆49Updated this week
- ☆25Updated last year
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆74Updated 3 months ago
- whisper.cpp bindings for python☆95Updated last year
- ☆112Updated 4 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 9 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 11 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- A fast batching API to serve LLM models☆182Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- ASR + diarization model server with speculative decoding☆60Updated 11 months ago
- ☆96Updated last year