uiuc-sst / asr24
24-hour Automatic Speech Recognition
☆27Updated 3 years ago
Alternatives and similar repositories for asr24:
Users that are interested in asr24 are comparing it to the libraries listed below
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 4 months ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 5 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 6 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 6 months ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- Hybrid speech synthesiser☆28Updated 6 years ago
- ☆45Updated 5 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 6 years ago
- ☆58Updated 5 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 8 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago