dabinat / deepspeech-toolsLinks
Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Updated 6 years ago
Alternatives and similar repositories for deepspeech-tools
Users that are interested in deepspeech-tools are comparing it to the libraries listed below
Sorting:
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- ☆48Updated 4 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 8 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Keyword Search Recipe for Subword ASR☆30Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- ☆76Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Custom decoders for Kaldi☆13Updated 6 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- ☆37Updated 3 months ago
- Text-to-Speech tutorial at SLTU 2016☆34Updated 9 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- create CMakeLists.txt for kaldi☆20Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 5 months ago