This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).
☆38Apr 29, 2024Updated last year
Alternatives and similar repositories for HiPAMA
Users that are interested in HiPAMA are comparing it to the libraries listed below
Sorting:
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆198Feb 13, 2023Updated 3 years ago
- ☆13Apr 9, 2021Updated 4 years ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆14May 6, 2025Updated 9 months ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Oct 23, 2023Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆53Nov 17, 2021Updated 4 years ago
- ☆20Apr 12, 2025Updated 10 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆36Feb 5, 2026Updated 3 weeks ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated 11 months ago
- A non-native English corpus for pronunciation scoring task☆169Oct 26, 2025Updated 4 months ago
- This repository is the implementation of the ProTACT architecture, introduced in the paper "Prompt- and Trait Relation-aware Cross-prompt…☆23Feb 5, 2025Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆14Nov 25, 2024Updated last year
- Speech Assessment API in FastAPI with HuggingFace 🤗☆13May 18, 2025Updated 9 months ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Pronunciation Evaluation☆99Jul 20, 2025Updated 7 months ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Universal multilingual automatic speech transcription into IPA☆77Feb 28, 2025Updated last year
- ☆18Jan 18, 2024Updated 2 years ago
- ☆17Jul 12, 2020Updated 5 years ago
- High-resolution facial landmark detection in artworks☆22Dec 17, 2023Updated 2 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Spoken Language assessment☆46Nov 17, 2020Updated 5 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆59Jun 3, 2024Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- ☆23Dec 14, 2021Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Kaldi-based goodness of pronunciation (GOP)☆158Feb 4, 2021Updated 5 years ago
- Audio Diarization Annotation tool☆30Nov 8, 2019Updated 6 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 5 years ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Jan 10, 2022Updated 4 years ago