ronggong / mispronunciation-detection
Mispronunciation detection code for jingju singing voice
☆20Updated 6 years ago
Alternatives and similar repositories for mispronunciation-detection:
Users that are interested in mispronunciation-detection are comparing it to the libraries listed below
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆58Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Curriculum Vitae of Quan Wang☆14Updated last month
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 4 months ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- ☆53Updated 4 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago