yh1008 / speech-to-textView external linksLinks
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
☆71Nov 20, 2017Updated 8 years ago
Alternatives and similar repositories for speech-to-text
Users that are interested in speech-to-text are comparing it to the libraries listed below
Sorting:
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆55Jan 2, 2020Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- Keras Interface for Kaldi ASR☆122Sep 27, 2017Updated 8 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Feb 19, 2018Updated 7 years ago
- a kws demo on android☆40May 28, 2024Updated last year
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- ☆13Mar 25, 2021Updated 4 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- An advance kaldi wrapper for Pyhton☆38Mar 1, 2021Updated 4 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- ☆41Jun 25, 2018Updated 7 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆145Oct 12, 2017Updated 8 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago