A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 4 years ago
Alternatives and similar repositories for CTC-Attention-Mispronunciation
Users that are interested in CTC-Attention-Mispronunciation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆35Jan 23, 2024Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆54Nov 17, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆198Feb 13, 2023Updated 3 years ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆15May 6, 2025Updated 10 months ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- Pronunciation Evaluation☆99Jul 20, 2025Updated 8 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- Kaldi-based goodness of pronunciation (GOP)☆160Feb 4, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆236Apr 3, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors☆12Oct 28, 2015Updated 10 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Meta-learning model agnostic (MAML) implementation for cross-accented ASR☆45Feb 9, 2024Updated 2 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- A non-native English corpus for pronunciation scoring task☆171Oct 26, 2025Updated 5 months ago
- LaTeX Thesis Template for Beijing Language and Culture University☆18Apr 10, 2025Updated 11 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- SEAME corpus two develop set☆42Dec 5, 2019Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago