PranavPutsa1006 / Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
β18Updated last year
Alternatives and similar repositories for Speaker-Diarization:
Users that are interested in Speaker-Diarization are comparing it to the libraries listed below
- Audio processing using deep neural networks. Speaker identification using voice embeddings.β13Updated 2 years ago
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- Speaker diarization serviceβ21Updated last month
- Deep Learning model for lexical stress detection in spoken Englishβ29Updated 5 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ75Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ98Updated last month
- Speaker change detection using SincNet and an LSTM/Transformerβ48Updated 8 months ago
- This project is about performing Speaker diarization for Hindi Language.β49Updated 4 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated last year
- Create an LJSpeech structured voice dataset on wave inputβ26Updated 5 months ago
- Zero-shot Audio Classification using Whisperβ80Updated 2 years ago
- Speaker diarization via transfer learningβ27Updated 5 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β82Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- asr2kβ49Updated 9 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 7 months ago
- A curated list of awesome voice activity detectionβ44Updated 4 months ago
- Tunable pipelinesβ31Updated last month
- Real-time Speech Separation, Noise Suppression & Speaker Recognitionβ18Updated 5 years ago
- Code for AccentDB.β20Updated 3 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.β50Updated 2 years ago
- A python package for whisper normalizerβ53Updated 3 weeks ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- Feature extractor for DL speech processing.β65Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ110Updated 2 years ago
- β17Updated last year
- Swarah: Indian-English speech dataset collected across the countryβ29Updated last year