sweekarsud / Goodness-of-PronunciationLinks
☆92Updated 2 years ago
Alternatives and similar repositories for Goodness-of-Pronunciation
Users that are interested in Goodness-of-Pronunciation are comparing it to the libraries listed below
Sorting:
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆232Updated 6 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆61Updated 4 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 3 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆257Updated 5 years ago
- A pytorch based end2end speech recognition system.☆115Updated 4 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆77Updated 4 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- ☆61Updated 2 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆174Updated 5 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 5 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆135Updated last year
- ASR for Chinese Mandarin☆75Updated 7 years ago
- ☆76Updated 3 years ago
- ☆38Updated 5 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- A Python toolbox for speech features extraction☆163Updated 2 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 5 years ago