JohannesBuchner / spoken-command-recognitionLinks
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
☆69Updated 7 years ago
Alternatives and similar repositories for spoken-command-recognition
Users that are interested in spoken-command-recognition are comparing it to the libraries listed below
Sorting:
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- HTK features in Python☆73Updated 6 years ago
- ☆26Updated 8 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- Deep Neural Network for Speaker Count Estimation☆156Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 5 years ago
- Speech Enhancement using Bayesian WaveNet☆97Updated 7 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆227Updated 4 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Some notes on Kaldi☆31Updated 10 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆80Updated 4 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 7 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago