JohannesBuchner / spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
☆69Updated 7 years ago
Alternatives and similar repositories for spoken-command-recognition:
Users that are interested in spoken-command-recognition are comparing it to the libraries listed below
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Collection of machine learning demos for Automatic Speech Recognition☆55Updated 3 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- ☆38Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- ☆26Updated 7 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- Speech recognition on the TIMIT (or any other) dataset☆42Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Some notes on Kaldi☆31Updated 10 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 5 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Deep Convolution Text to Speech☆35Updated 7 years ago
- Phoneme Recognition using RecNet☆98Updated 8 years ago