aalto-speech / speaker-diarization
Speaker diarization scripts, based on AaltoASR
☆190Updated 6 years ago
Alternatives and similar repositories for speaker-diarization:
Users that are interested in speaker-diarization are comparing it to the libraries listed below
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆205Updated last month
- A list of publically available audio data that anyone can download for ASR or other speech activities☆207Updated 3 years ago
- Python interface for forced audio alignment using HTK and SoX☆337Updated 4 years ago
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 9 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆326Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆412Updated last year
- End-2-end speech synthesis with recurrent neural networks☆226Updated last year
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago
- ASR with PyTorch☆139Updated 6 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Voice Activity Detector in Python☆475Updated 4 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Updated 7 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 11 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- FastCGI support for Kaldi ASR☆184Updated 6 years ago
- Diarization scoring tools.☆240Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆368Updated 2 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆309Updated 3 years ago