JohannesBuchner / spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
☆68Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for spoken-command-recognition
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- ☆59Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆137Updated 4 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- Text Independent Speaker Verification Using GE2E Loss☆83Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- ☆26Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- Tutorial on Kaldi for Brandeis ASR course☆76Updated 4 years ago
- An open-source speech separation and enhancement library☆211Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- ☆131Updated 6 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 3 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- Voice Activity Detector☆72Updated last year
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆106Updated 8 months ago
- Visualization toolbox for Sound Event Detection☆116Updated 8 months ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 2 years ago
- deep clustering method for single-channel speech separation☆109Updated 2 years ago