vocaliodmiku/wav2vec2mdd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vocaliodmiku/wav2vec2mdd)

vocaliodmiku / wav2vec2mdd

End-to-End Mispronunciation Detection via wav2vec2.0

☆52

Alternatives and similar repositories for wav2vec2mdd

Users that are interested in wav2vec2mdd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
Mu-Y / mpl-mdd
View on GitHub
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…
☆38Jan 23, 2024Updated 2 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
sweekarsud / Goodness-of-Pronunciation
View on GitHub
Pronunciation Evaluation
☆101Jul 20, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tzyll / goparrot
View on GitHub
Goodness of Pronunciation (GOP) for oral reading assessment.
☆55Nov 17, 2021Updated 4 years ago
jimbozhang / speechocean762
View on GitHub
A non-native English corpus for pronunciation scoring task
☆190Oct 26, 2025Updated 9 months ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
vakila / de-stress
View on GitHub
Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors
☆12Oct 28, 2015Updated 10 years ago
moisesveleta / GOP-LSTM
View on GitHub
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Sep 26, 2018Updated 7 years ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
jimbozhang / kaldi-gop
View on GitHub
Kaldi-based goodness of pronunciation (GOP)
☆161Feb 4, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
x-lixu / pyResults
View on GitHub
A tool for calculating WER (Word Error Rate) in python.
☆14Sep 18, 2024Updated last year
zhitko / inton-trainer
View on GitHub
Inton Trainer is designed for learning the intonation of oral speech.
☆13Jun 27, 2026Updated last month
changelinglab / prism
View on GitHub
A toolkit and benchmark for evaluating phonetic capabilities of speech models.
☆18Apr 10, 2026Updated 3 months ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
Louis0324 / DDSP-Articulatory-Vocoder
View on GitHub
☆29Sep 5, 2024Updated last year
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
JawadAr / Pronunciation-verification-using-anomaly-detection-Thesis
View on GitHub
This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…
☆26Jun 25, 2019Updated 7 years ago
pashanitw / W2V2-BERT-ASR-Training
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
matthewmorrone / cmudict-ipa
View on GitHub
CMU dictionary in IPA instead of their subset of Arpabet
☆16Jun 21, 2026Updated last month
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
MontrealCorpusTools / MFA-reorganization-scripts
View on GitHub
Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner
☆43Jun 22, 2021Updated 5 years ago
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
HKAB / whisper-finetune-vietnamese
View on GitHub
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Oct 6, 2023Updated 2 years ago
juice500ml / dysarthria-gop
View on GitHub
Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…
☆28Mar 13, 2025Updated last year
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago