meronym / speaker-transcription
Transcription with speaker diarization pipeline
☆86Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speaker-transcription
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- ☆152Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆169Updated 2 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- streaming speech to text server using Whisper☆83Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- A testing repo to share code and thoughts on diarisation☆53Updated 7 months ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆248Updated 2 years ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆186Updated 5 months ago
- A curated list of awesome OpenAI's Whisper☆93Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago
- ☆253Updated 5 months ago
- The code for some apps built with Sieve.☆71Updated last month
- Faster Tortoise inference then Tortoise Fast Fork☆122Updated 7 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- ☆347Updated 8 months ago
- Audio datasets, easier.☆83Updated last year
- ☆32Updated last year
- Python bindings for whisper.cpp☆216Updated 5 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- ☆87Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆444Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆275Updated last week
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- ☆84Updated last year
- whisper.cpp bindings for python☆77Updated last year