Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-task Learning"
☆25Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for joint-apa-mdd-mtl
Users that are interested in joint-apa-mdd-mtl are comparing it to the libraries listed below
Sorting:
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated 11 months ago
- ☆21Apr 12, 2025Updated 10 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆54Nov 17, 2021Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆199Feb 13, 2023Updated 3 years ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆14May 6, 2025Updated 10 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- ☆18Jan 18, 2024Updated 2 years ago
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆19May 14, 2025Updated 9 months ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- A non-native English corpus for pronunciation scoring task☆169Oct 26, 2025Updated 4 months ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- ☆23Dec 14, 2021Updated 4 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Oct 23, 2023Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 5 years ago
- ☆27Mar 29, 2021Updated 4 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- ☆10Jul 29, 2022Updated 3 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…☆13Mar 30, 2023Updated 2 years ago
- A reddit scraping and analysis bot to visualize linguistic and content trends☆11Oct 5, 2021Updated 4 years ago
- 该仓库是 BUPT 智能系统实验室的法律大模型项目,基于 ChatGLM 等开源大模型进行实现。☆11Nov 28, 2023Updated 2 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10May 27, 2024Updated last year
- Self-Supervised MRI Reconstruction☆10May 25, 2021Updated 4 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated last year
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆12Aug 16, 2022Updated 3 years ago
- WindTurbineHighSpeedBearingPrognosis-Data☆10Aug 19, 2020Updated 5 years ago