aalto-speech / speaker-diarizationLinks
Speaker diarization scripts, based on AaltoASR
☆190Updated 6 years ago
Alternatives and similar repositories for speaker-diarization
Users that are interested in speaker-diarization are comparing it to the libraries listed below
Sorting:
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆213Updated 4 months ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated 2 years ago
- Python interface for forced audio alignment using HTK and SoX☆341Updated 4 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆472Updated 5 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆536Updated 3 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆442Updated 11 months ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- A pure python module for reading and writing kaldi ark files☆259Updated 3 months ago
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- Diarization scoring tools.☆246Updated 2 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆586Updated 3 years ago
- Tools for Speech Enhancement integrated with Kaldi☆415Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆379Updated 2 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆174Updated 4 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆445Updated 5 years ago
- End-2-end speech synthesis with recurrent neural networks☆226Updated last year
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- Speech-to-text based on wav2letter built for transfer learning☆97Updated 2 years ago
- Variational Bayes HMM over x-vectors diarization☆269Updated last year
- ASR with PyTorch☆139Updated 6 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆316Updated 4 years ago