rhss10 / joint-apa-mdd-mtlView external linksLinks
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-task Learning"
☆25Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for joint-apa-mdd-mtl
Users that are interested in joint-apa-mdd-mtl are comparing it to the libraries listed below
Sorting:
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- ☆20Apr 12, 2025Updated 10 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆53Nov 17, 2021Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆199Feb 13, 2023Updated 3 years ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆14May 6, 2025Updated 9 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- ☆18Jan 18, 2024Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆19May 14, 2025Updated 9 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- A non-native English corpus for pronunciation scoring task☆166Oct 26, 2025Updated 3 months ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Oct 23, 2023Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- ☆23Dec 14, 2021Updated 4 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 5 years ago
- ☆27Mar 29, 2021Updated 4 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆15Nov 11, 2025Updated 3 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- ImageQA is a tool for analyzing digital image quality according to specific attributes such as color, tone transfer, noise or resolution.…☆10Sep 18, 2024Updated last year
- "SSPNet: An interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data," publish…☆10May 13, 2022Updated 3 years ago
- A reddit scraping and analysis bot to visualize linguistic and content trends☆12Oct 5, 2021Updated 4 years ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.☆12May 10, 2022Updated 3 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- This Repo contains a fully functional API ready application for delineating fields for smart farming platform☆15Jan 20, 2023Updated 3 years ago
- Self-Supervised MRI Reconstruction☆10May 25, 2021Updated 4 years ago
- Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10May 27, 2024Updated last year