rhss10/joint-apa-mdd-mtl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rhss10/joint-apa-mdd-mtl)

rhss10 / joint-apa-mdd-mtl

Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-task Learning"

☆25

Alternatives and similar repositories for joint-apa-mdd-mtl

Users that are interested in joint-apa-mdd-mtl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
Mu-Y / mpl-mdd
View on GitHub
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…
☆38Jan 23, 2024Updated 2 years ago
doheejin / HiPAMA
View on GitHub
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…
☆40Apr 29, 2024Updated 2 years ago
juice500ml / dysarthria-gop
View on GitHub
Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…
☆28Mar 13, 2025Updated last year
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
tzyll / goparrot
View on GitHub
Goodness of Pronunciation (GOP) for oral reading assessment.
☆55Nov 17, 2021Updated 4 years ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
ai-zahran / E2E-R
View on GitHub
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆29Oct 23, 2023Updated 2 years ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
xiuwenz2 / SAPC-template
View on GitHub
☆17Mar 20, 2026Updated 4 months ago
cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yuwchen / MultiPA
View on GitHub
☆21Jun 25, 2026Updated last month
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago
vocaliodmiku / wav2vec2mdd
View on GitHub
End-to-End Mispronunciation Detection via wav2vec2.0
☆52Dec 7, 2021Updated 4 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
jcvasquezc / phonet
View on GitHub
Keras-based python framework to compute phonological posterior probabilities from audio files
☆48Dec 27, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
jimbozhang / kaldi-gop
View on GitHub
Kaldi-based goodness of pronunciation (GOP)
☆161Feb 4, 2021Updated 5 years ago
vaibhavsundharam / Speech-Emotion-Analysis
View on GitHub
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆25Jun 23, 2021Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
sweekarsud / Goodness-of-Pronunciation
View on GitHub
Pronunciation Evaluation
☆101Jul 20, 2025Updated last year
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
jimbozhang / speechocean762
View on GitHub
A non-native English corpus for pronunciation scoring task
☆190Oct 26, 2025Updated 9 months ago
nikvaessen / disjoint-mtl
View on GitHub
Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf
☆12Dec 2, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JawadAr / Pronunciation-verification-using-anomaly-detection-Thesis
View on GitHub
This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…
☆26Jun 25, 2019Updated 7 years ago
wonjune-kang / llm-speech-summarization
View on GitHub
Prompting Large Language Models with Audio for General-Purpose Speech Summarization
☆20May 14, 2025Updated last year
razvan404 / multimodal-speech-emotion-recognition
View on GitHub
Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…
☆11Jun 19, 2024Updated 2 years ago
vakila / de-stress
View on GitHub
Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors
☆12Oct 28, 2015Updated 10 years ago
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
spaceraccoon / accent-trainer
View on GitHub
Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…
☆18Jun 27, 2017Updated 9 years ago