aalto-speech / speaker-diarization
Speaker diarization scripts, based on AaltoASR
☆190Updated 5 years ago
Related projects: ⓘ
- DeepSpeech based forced alignment tool☆232Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆184Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- End-2-end speech synthesis with recurrent neural networks☆225Updated 6 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆198Updated 3 years ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- Speech-to-text based on wav2letter built for transfer learning☆95Updated last year
- Tool for creation, manipulation and maintenance of voice corpora☆80Updated 4 months ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆125Updated 5 years ago
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 7 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆422Updated 4 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 6 years ago
- ASR with PyTorch☆140Updated 5 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Adapting your own Language Model for Kaldi☆64Updated 5 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆345Updated this week
- A Collection of Speech Corpus for ASR and TTS☆112Updated 7 years ago
- FastCGI support for Kaldi ASR☆184Updated 5 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆576Updated 2 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆437Updated 2 months ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆333Updated 4 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Python interface for forced audio alignment using HTK and SoX☆331Updated 4 years ago
- Tools for Speech Enhancement integrated with Kaldi☆394Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆464Updated 3 years ago
- A pure python module for reading and writing kaldi ark files☆248Updated last year
- Diarization scoring tools.☆213Updated last year