[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
☆34Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for mpl-mdd
Users that are interested in mpl-mdd are comparing it to the libraries listed below
Sorting:
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- ☆14Aug 19, 2024Updated last year
- ☆20Apr 12, 2025Updated 10 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆49May 6, 2024Updated last year
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Oct 23, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Updated this week
- ☆27Mar 29, 2021Updated 4 years ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated 10 months ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆23Nov 14, 2024Updated last year
- ☆14Jul 24, 2025Updated 7 months ago
- A non-native English corpus for pronunciation scoring task☆169Oct 26, 2025Updated 4 months ago
- A family of efficient speech models for multilingual phone recognition☆45Feb 12, 2026Updated 2 weeks ago
- ☆19Jun 28, 2022Updated 3 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- ☆25Jul 10, 2023Updated 2 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…☆24Jan 31, 2025Updated last year
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆198Feb 13, 2023Updated 3 years ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆81Oct 3, 2024Updated last year
- ☆82Jan 22, 2025Updated last year
- ☆10Jul 29, 2022Updated 3 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- 파파고 비공식 번역 자동화 도구 (Unofficial Papago API using reverse-engineered web endpoints)☆10Jul 4, 2025Updated 8 months ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Sep 21, 2022Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- This Repo contains a fully functional API ready application for delineating fields for smart farming platform☆15Jan 20, 2023Updated 3 years ago
- 该仓库是 BUPT 智能系统实验室的法律大模型项目,基于 ChatGLM 等开源大模型进行实现。☆11Nov 28, 2023Updated 2 years ago
- EEG-based Major Depression Disorder Recognition using Swin Transformers☆10Jun 23, 2024Updated last year
- Documentation and code for predictive maintenance data and assess scripts.☆11Jun 8, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated 11 months ago