Mastering-Python-GT / Transcription-diarization-whisper-pyannote
Transcription and diarization (speaker identification)
☆28Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Transcription-diarization-whisper-pyannote
- Efficient approach to speaker diarization using voice characteristics extraction☆66Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated last week
- whisper.cpp bindings for python☆76Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆274Updated 9 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆166Updated last month
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆152Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- A curated list of awesome OpenAI's Whisper☆93Updated last year
- A testing repo to share code and thoughts on diarisation☆51Updated 7 months ago
- Code for OpenAI Whisper Web App Demo☆95Updated 2 years ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆18Updated last month
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆45Updated 3 months ago
- Live transcription with OpenAi Whisper☆50Updated last year
- Runpod WhisperX Docker Container Repo☆11Updated 7 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆226Updated 3 months ago
- Python bindings for whisper.cpp☆169Updated this week
- streaming speech to text server using Whisper☆83Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆139Updated 6 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆74Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆249Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆66Updated 3 weeks ago
- ☆152Updated last year
- Transcription with speaker diarization pipeline☆85Updated last year
- ☆77Updated 4 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆204Updated 4 months ago
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆61Updated last year
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆330Updated 10 months ago