team-re-verb / RE-VERBLinks
speaker diarization system using an LSTM
☆50Updated 2 years ago
Alternatives and similar repositories for RE-VERB
Users that are interested in RE-VERB are comparing it to the libraries listed below
Sorting:
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- python wrapper for rnnoise library☆48Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 3 weeks ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆257Updated last year
- DeepSpeech based forced alignment tool☆239Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆89Updated last year
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆227Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Deep Convolution Text to Speech☆34Updated 7 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- Voice Activity Detection (VAD) using deep learning.☆200Updated 5 years ago