shirayu / whispering
Streaming transcriber with whisper
☆685Updated last year
Related projects: ⓘ
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆774Updated 4 months ago
- ☆443Updated last year
- A python package to build AI-powered real-time audio applications☆992Updated 2 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆1,483Updated last week
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆1,865Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- Project that allows one to use a microphone with OpenAI whisper.☆700Updated 2 months ago
- Real time speech to text transcription app.☆379Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆421Updated 10 months ago
- ☆486Updated 4 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆262Updated 7 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆322Updated 8 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆185Updated last year
- Real time transcription with OpenAI Whisper.☆2,260Updated 3 months ago
- Pybind11 bindings for Whisper.cpp☆321Updated this week
- ☆431Updated 2 months ago
- ☆342Updated 6 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆234Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio files☆278Updated last year
- Fast TorToiSe inference (5x or your money back!)☆777Updated 2 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆315Updated 7 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆866Updated last month
- A nearly-live implementation of OpenAI's Whisper.☆1,798Updated 2 weeks ago
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch☆1,168Updated this week
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆878Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆65Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆276Updated 3 weeks ago
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch☆1,263Updated 11 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆551Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆3,969Updated last week