Fuann/hmamba

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Fuann/hmamba)

Fuann / hmamba

Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss (NAACL 2025).

☆16

Alternatives and similar repositories for hmamba

Users that are interested in hmamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
doheejin / HiPAMA
View on GitHub
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…
☆40Apr 29, 2024Updated 2 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
vocaliodmiku / wav2vec2mdd
View on GitHub
End-to-End Mispronunciation Detection via wav2vec2.0
☆52Dec 7, 2021Updated 4 years ago
Berkeley-Speech-Group / DysfluentWFST
View on GitHub
DysfluentWFST
☆19Nov 13, 2025Updated 8 months ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
scir-zywang / self-training-self-supervised-disfluency
View on GitHub
☆39Jan 18, 2021Updated 5 years ago
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
maxrmorrison / torbi
View on GitHub
Viterbi decoding in PyTorch
☆42May 5, 2026Updated 2 months ago
juice500ml / dysarthria-gop
View on GitHub
Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…
☆28Mar 13, 2025Updated last year
ai-zahran / E2E-R
View on GitHub
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆29Oct 23, 2023Updated 2 years ago
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
Mocahteam / E-LearningScape
View on GitHub
☆12Feb 3, 2026Updated 5 months ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Auroraaa86 / LCS-CTC
View on GitHub
For IEEE ASRU(2025)
☆15Jun 21, 2025Updated last year
rorizzz / YOLO-Stutter
View on GitHub
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆21Mar 4, 2025Updated last year
vakila / de-stress
View on GitHub
Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors
☆12Oct 28, 2015Updated 10 years ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated last week
BayBenj / english-syllabifier
View on GitHub
Tool for parsing English phonemes into syllables.
☆10Jan 15, 2018Updated 8 years ago
LouChao98 / nner_as_parsing
View on GitHub
☆16Mar 22, 2023Updated 3 years ago
ErikEkstedt / conv_ssl
View on GitHub
☆14Feb 9, 2023Updated 3 years ago
TalkBank / TBDBpy
View on GitHub
Python API to TalkBankDB.
☆13Jan 22, 2024Updated 2 years ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
colinator / timit_utils
View on GitHub
Python/numpy/pandas convenience wrapper for the TIMIT database.
☆11Nov 26, 2018Updated 7 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
craft-ai / sci
View on GitHub
craft ai team scientific activities
☆14May 30, 2024Updated 2 years ago
KDE / artikulate
View on GitHub
Pronunciation trainer to improve your skills by listening to native speakers
☆20Jul 17, 2026Updated last week
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆18Jun 12, 2022Updated 4 years ago