Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss (NAACL 2025).
☆14May 6, 2025Updated 10 months ago
Alternatives and similar repositories for hmamba
Users that are interested in hmamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- DysfluentWFST☆18Nov 13, 2025Updated 4 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- ☆21Apr 12, 2025Updated 11 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆199Feb 13, 2023Updated 3 years ago
- ☆39Jan 18, 2021Updated 5 years ago
- Viterbi decoding in PyTorch☆41Sep 10, 2025Updated 6 months ago
- ☆10Jun 8, 2022Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆37Jul 24, 2025Updated 7 months ago
- ☆12Feb 3, 2026Updated last month
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors☆12Oct 28, 2015Updated 10 years ago
- YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection☆20Mar 4, 2025Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆37Feb 5, 2026Updated last month
- ☆15Mar 22, 2023Updated 3 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- ☆14Nov 26, 2024Updated last year
- Pronunciation trainer to improve your skills by listening to native speakers☆20Mar 13, 2026Updated last week
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- craft ai team scientific activities☆13May 30, 2024Updated last year
- Python API to TalkBankDB.☆12Jan 22, 2024Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- LaTeX Thesis Template for Beijing Language and Culture University☆18Apr 10, 2025Updated 11 months ago
- ☆17Jul 23, 2025Updated 8 months ago
- ☆15Apr 3, 2019Updated 6 years ago
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- A tutorial diphone synthesizer in Python☆25Nov 26, 2018Updated 7 years ago
- Deep Articulatory Synthesis and Inversion☆55Feb 14, 2024Updated 2 years ago
- Redactle, but in French | Redactle, mais en français☆16Updated this week
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated last year
- A Sketch plugin that lets you auto-switch light/dark symbols, text and layer styles.☆10Jan 4, 2023Updated 3 years ago
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated last year
- The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral s…☆22Jul 11, 2024Updated last year