QuentinFuxa / WhisperLiveKit
Real-time, Fully Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface
☆159Updated this week
Alternatives and similar repositories for WhisperLiveKit:
Users that are interested in WhisperLiveKit are comparing it to the libraries listed below
- ☆1,609Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,656Updated 2 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆376Updated 7 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆639Updated 3 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆837Updated 5 months ago
- A nearly-live implementation of OpenAI's Whisper.☆2,628Updated last month
- FastAPI service on top of WhisperX☆77Updated this week
- A python package to build AI-powered real-time audio applications☆1,221Updated last month
- ☆571Updated 10 months ago
- Interface for OuteTTS models.☆957Updated last month
- Python bindings for whisper.cpp☆236Updated this week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆316Updated 4 months ago
- Local SRT/LLM/TTS Voicechat☆648Updated 5 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 5 months ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆247Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- G2P☆182Updated this week
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- Open source inference code for Rev's model☆389Updated 3 weeks ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆153Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆104Updated 10 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆189Updated last month
- Real-time transcription using faster-whisper☆454Updated 8 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆195Updated last month
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆996Updated last month
- Command Your World with Voice☆621Updated 3 months ago
- A Fast TTS Engine☆471Updated 2 months ago
- OpenAI compatible TTS for Sesame CSM:1b - Voice Cloning from File/YT☆227Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆254Updated 4 months ago