chinaboard / whisperX-service
WhisperX Service love docker!
☆13Updated 7 months ago
Alternatives and similar repositories for whisperX-service:
Users that are interested in whisperX-service are comparing it to the libraries listed below
- streaming speech to text server using Whisper☆90Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- FastAPI service on top of WhisperX☆76Updated this week
- Pybind11 bindings for Whisper.cpp☆54Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Speech Diarization for scrum automation☆102Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆91Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆45Updated 5 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆60Updated 10 months ago
- whisper.cpp bindings for python☆92Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 7 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- OpenAI Whisper API-style local server, runnig on FastAPI☆76Updated 3 months ago
- ☆318Updated 8 months ago
- A streaming whisper server for on-prem transcription☆20Updated 7 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Transcription and diarization (speaker identification)☆31Updated last year
- ASR + diarization model server with speculative decoding☆59Updated 10 months ago
- ☆36Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆129Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆61Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- ☆24Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆28Updated 3 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆73Updated last week
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆34Updated last year