Jose-Sabater / whisper-pyannote
Whisper from OpenAi and diarization with Pyannote
β36Updated last year
Alternatives and similar repositories for whisper-pyannote:
Users that are interested in whisper-pyannote are comparing it to the libraries listed below
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β190Updated last week
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β205Updated 3 months ago
- Transcription and diarization (speaker identification)β31Updated last year
- β94Updated 9 months ago
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β84Updated 2 weeks ago
- Transcription with speaker diarization pipelineβ90Updated last year
- β154Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ92Updated 9 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ88Updated 9 months ago
- A testing repo to share code and thoughts on diarisationβ53Updated 10 months ago
- Simulates talk with an AI that can express emotionsβ54Updated 6 months ago
- whisper.cpp bindings for pythonβ87Updated last year
- streaming speech to text server using Whisperβ86Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β60Updated last year
- Python bindings for whisper.cppβ221Updated this week
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ198Updated 8 months ago
- Site for sharing Bark voicesβ48Updated 7 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ77Updated last year
- Python bindings for whisper.cppβ228Updated 8 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptionsβ46Updated 2 years ago
- FastAPI service on top of WhisperXβ68Updated 3 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ23Updated last year
- β200Updated 4 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ306Updated 3 months ago
- β35Updated 2 years ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ192Updated this week
- Transcription and Diarization based on OpenAI's Whisperβ21Updated last year
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated 10 months ago
- β560Updated 9 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β135Updated last year