Mastering-Python-GT / Transcription-diarization-whisper-pyannoteLinks
Transcription and diarization (speaker identification)
β33Updated 2 years ago
Alternatives and similar repositories for Transcription-diarization-whisper-pyannote
Users that are interested in Transcription-diarization-whisper-pyannote are comparing it to the libraries listed below
Sorting:
- Efficient approach to speaker diarization using voice characteristics extractionβ94Updated last year
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β213Updated 7 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β217Updated last month
- FastAPI service on top of WhisperXβ101Updated this week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ326Updated 6 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β308Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ343Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ65Updated 2 years ago
- β37Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β83Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ100Updated 3 months ago
- A testing repo to share code and thoughts on diarisationβ55Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β120Updated last year
- Whisper from OpenAi and diarization with Pyannoteβ43Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ151Updated last year
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviewsβ47Updated 9 months ago
- Zero-shot Audio Classification using Whisperβ79Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated last week
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.β36Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- β591Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ51Updated last week
- ONNX Inference of Pyannote Segmentationβ90Updated 5 months ago
- Speaker diarization serviceβ23Updated last month
- A VoiceAsistant with WhisperAI speech recognitionβ30Updated 6 months ago
- Live transcription with OpenAi Whisperβ50Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ116Updated last year