team-re-verb / RE-VERB
speaker diarization system using an LSTM
☆50Updated 2 years ago
Alternatives and similar repositories for RE-VERB:
Users that are interested in RE-VERB are comparing it to the libraries listed below
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Python framework for Speech and Music Detection using Keras.☆104Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Python library for handling audio datasets.☆137Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆61Updated 4 years ago
- Diarization scoring tools.☆240Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Variational Bayes HMM over x-vectors diarization☆266Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- ☆39Updated last year
- Python library for audio augmentation☆83Updated last year
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- Deep Neural Network for Speaker Count Estimation☆147Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- python wrapper for rnnoise library☆47Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago