crazycloud/mispronunciation-detection-diagnosis-wav2vec2-and-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/crazycloud/mispronunciation-detection-diagnosis-wav2vec2-and-llm)

crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm

Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model

☆59

Alternatives and similar repositories for mispronunciation-detection-diagnosis-wav2vec2-and-llm

Users that are interested in mispronunciation-detection-diagnosis-wav2vec2-and-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JawadAr / Pronunciation-verification-using-anomaly-detection-Thesis
View on GitHub
This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…
☆26Jun 25, 2019Updated 7 years ago
vocaliodmiku / wav2vec2mdd
View on GitHub
End-to-End Mispronunciation Detection via wav2vec2.0
☆52Dec 7, 2021Updated 4 years ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
Mu-Y / mpl-mdd
View on GitHub
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…
☆38Jan 23, 2024Updated 2 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
rudder-analytics / Goodness-of-Pronounciation
View on GitHub
☆54Apr 12, 2024Updated 2 years ago
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
anas-rz / specmix-pytorch
View on GitHub
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆10Oct 5, 2022Updated 3 years ago
KoelLabs / ML
View on GitHub
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…
☆25Jul 13, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dhimasryan / STOI-Net
View on GitHub
☆29Nov 7, 2023Updated 2 years ago
tzyll / goparrot
View on GitHub
Goodness of Pronunciation (GOP) for oral reading assessment.
☆55Nov 17, 2021Updated 4 years ago
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
lapic-ufjf / searchat-behavior
View on GitHub
A standardized framework for capturing authentic human behavior in search and AI-chat experiments.
☆21Updated this week
Thiagohgl / ai-pronunciation-trainer
View on GitHub
This tool uses AI to evaluate your pronunciation.
☆509Aug 16, 2025Updated 11 months ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
sweekarsud / Goodness-of-Pronunciation
View on GitHub
Pronunciation Evaluation
☆101Jul 20, 2025Updated last year
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gbegus / articulationGAN
View on GitHub
☆24Sep 1, 2023Updated 2 years ago
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
eduardorochasoares / easytopic
View on GitHub
A pipeline architecture for temporal segmentation of video lectures.
☆12Sep 8, 2020Updated 5 years ago
AmirAbaskohi / Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children
View on GitHub
Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …
☆21May 24, 2023Updated 3 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
yudaleng / COPD-Early-Prediction
View on GitHub
Code repository for the paper “Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time…
☆16Jul 20, 2025Updated last year
HazyResearch / anchor-stability
View on GitHub
A study of the downstream instability of word embeddings
☆12Aug 23, 2022Updated 3 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated 2 weeks ago
neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
Nyralei / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆12Aug 1, 2025Updated 11 months ago