pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆71Updated last week
Alternatives and similar repositories for whisperX-FastAPI:
Users that are interested in whisperX-FastAPI are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆112Updated last year
- Open source inference code for Rev's model☆383Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆194Updated last month
- Live-Transcription (STT) with Whisper PoC☆174Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- ez audio transcription tool with flexible processing and post-processing options☆146Updated last year
- Have a natural voice conversation with an LLM☆243Updated 3 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆250Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆44Updated 3 weeks ago
- web based editor for subtitles and transcripts☆123Updated 6 months ago
- whisper.cpp bindings for python☆89Updated last year
- Simulates talk with an AI that can express emotions☆56Updated 7 months ago
- ☆316Updated 8 months ago
- Speech Diarization for scrum automation☆102Updated last year
- A lightweight end-to-end text-to-speech model☆110Updated 2 weeks ago
- Whisperx API implementation☆25Updated 10 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆69Updated 9 months ago
- WhisperX Service love docker!☆13Updated 6 months ago
- ASR + diarization model server with speculative decoding☆58Updated 9 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆154Updated 7 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆87Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- A simple TTS server for generating speech using StyleTTS2☆36Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆96Updated 3 weeks ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆50Updated 7 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 2 months ago