aalto-speech / speaker-diarizationLinks
Speaker diarization scripts, based on AaltoASR
☆190Updated 6 years ago
Alternatives and similar repositories for speaker-diarization
Users that are interested in speaker-diarization are comparing it to the libraries listed below
Sorting:
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆212Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆184Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆311Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆475Updated 5 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆216Updated 4 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- Speech-to-text based on wav2letter built for transfer learning☆97Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Tutorial on Kaldi for Brandeis ASR course☆76Updated 5 years ago
- Python interface for forced audio alignment using HTK and SoX☆341Updated 5 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆126Updated 6 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆279Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆179Updated 3 years ago
- ASR with PyTorch☆139Updated 6 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated last year
- CNN to classify samples of voice recordings into the language that was spoken☆44Updated 6 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks☆105Updated 7 years ago
- An opensource speech-to-text software written in tensorflow☆158Updated 2 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆225Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆446Updated 5 years ago