juice500ml/dysarthria-gop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/juice500ml/dysarthria-gop)

juice500ml / dysarthria-gop

Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification" (Interspeech 2023)

☆28

Alternatives and similar repositories for dysarthria-gop

Users that are interested in dysarthria-gop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

juice500ml / dysarthria-mtl
View on GitHub
Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…
☆12Feb 14, 2024Updated 2 years ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
felixbur / nkululeko
View on GitHub
Machine learning speaker characteristics
☆46Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yuwchen / MultiPA
View on GitHub
☆21Jun 25, 2026Updated last month
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
acetylSv / non-parallel-rhythm-flexible-VC
View on GitHub
PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
☆11Jul 18, 2019Updated 7 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
ctaguchi / multipa
View on GitHub
Universal multilingual automatic speech transcription into IPA
☆80Feb 28, 2025Updated last year
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
aixplain / NoRefER
View on GitHub
☆18Jun 5, 2026Updated last month
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pacscilab / voxangeles
View on GitHub
VoxAngeles Corpus
☆15Aug 23, 2025Updated 11 months ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
vaibhavsundharam / Speech-Emotion-Analysis
View on GitHub
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆25Jun 23, 2021Updated 5 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
juice500ml / xlm_to_xlsr
View on GitHub
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Mar 12, 2024Updated 2 years ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
slp-rl / WhiStress
View on GitHub
The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆39Jul 24, 2025Updated last year
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
numediart / LaughterSynthesis
View on GitHub
This repository contains laughter-related synthesis systems.
☆13Nov 7, 2020Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
VoiceBank-NTPU-TW / VoiceBank-2023
View on GitHub
VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.
☆40Jan 4, 2026Updated 6 months ago
NiliRahmani / Alzheimer-s-Dementia-Recognition-through-Spontaneous-Speech
View on GitHub
Alzheimer's Dementia Recognition through Spontaneous Speech The ADReSSo Challenge
☆15Aug 6, 2023Updated 2 years ago
BUTSpeechFIT / hystoc
View on GitHub
Getting confidences from any end-to-end systems
☆11May 24, 2023Updated 3 years ago
cristinae / ASRdys
View on GitHub
ASR for dysarthric speakers with Kaldi
☆13Jan 14, 2017Updated 9 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
suralmasha / RuTranscript
View on GitHub
Russian phonetical transcription
☆11May 20, 2026Updated 2 months ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
sunprinceS / MetaASR-CrossAccent
View on GitHub
Meta-Learning for End-to-End ASR
☆10Aug 8, 2020Updated 5 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
articulatory / articulatory
View on GitHub
Deep Articulatory Synthesis and Inversion
☆57Feb 14, 2024Updated 2 years ago