reriiasu / speech-to-text
Real-time transcription using faster-whisper
☆402Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for speech-to-text
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆122Updated last year
- Massive open Japanese speech corpus☆242Updated last month
- Live-Transcription (STT) with Whisper PoC☆152Updated 4 months ago
- A nearly-live implementation of OpenAI's Whisper.☆2,018Updated this week
- Project that allows one to use a microphone with OpenAI whisper.☆717Updated 4 months ago
- ☆233Updated last year
- Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.☆742Updated 2 months ago
- A python package to build AI-powered real-time audio applications☆1,072Updated 4 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆909Updated this week
- ☆454Updated 3 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆274Updated 9 months ago
- 🥰 Building AI-based conversational avatars lightning fast ⚡️💬☆257Updated 5 months ago
- 💠 Aivis: AI Voice Imitation System☆143Updated 3 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,039Updated 3 weeks ago
- a gradio webui for faster whisper☆230Updated last year
- A quick experiment to achieve almost realtime transcription using Whisper.☆185Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆330Updated 10 months ago
- Real time speech to text transcription app.☆387Updated last year
- A multi-speaker, multilingual speech generation tool☆156Updated last year
- Python bindings for whisper.cpp☆169Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆174Updated 2 months ago
- AITuber Kit☆285Updated this week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆323Updated 5 months ago
- 文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。☆137Updated 10 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆1,561Updated this week
- ☆294Updated 4 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆249Updated 2 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆397Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆227Updated 3 weeks ago