chinaboard / whisperX-service
WhisperX Service love docker!
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for whisperX-service
- Whisper realtime streaming for long speech-to-text transcription and translation☆103Updated 9 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆62Updated this week
- FastAPI service on top of WhisperX☆41Updated this week
- Pybind11 bindings for Whisper.cpp☆45Updated 2 weeks ago
- Speech Diarization for scrum automation☆97Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- streaming speech to text server using Whisper☆83Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- whisper.cpp bindings for python☆77Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆95Updated this week
- ASR + diarization model server with speculative decoding☆50Updated 5 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆69Updated last month
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆49Updated 6 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆37Updated 4 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- ☆296Updated 4 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- openvino version of openai/whisper☆161Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆23Updated last month
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆70Updated 3 weeks ago
- A pipeline parallel training script for LLMs.☆83Updated this week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆54Updated 5 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆193Updated last week
- On-device streaming text-to-speech engine powered by deep learning☆56Updated 2 weeks ago
- Open source inference code for Rev's model☆333Updated last week
- ☆53Updated 5 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆110Updated 5 months ago