frank613/CTC-based-GOP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/frank613/CTC-based-GOP)

frank613 / CTC-based-GOP

This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024

☆41

Alternatives and similar repositories for CTC-based-GOP

Users that are interested in CTC-based-GOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
yuwchen / MultiPA
View on GitHub
☆21Jun 25, 2026Updated 3 weeks ago
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆18Jun 12, 2022Updated 4 years ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
doheejin / HiPAMA
View on GitHub
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…
☆40Apr 29, 2024Updated 2 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
ai-zahran / E2E-R
View on GitHub
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆29Oct 23, 2023Updated 2 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
tzyll / goparrot
View on GitHub
Goodness of Pronunciation (GOP) for oral reading assessment.
☆55Nov 17, 2021Updated 4 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
tabahi / bournemouth-forced-aligner
View on GitHub
Extract phoneme-level timestamps from speeh audio.
☆155Jun 7, 2026Updated last month
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rorizzz / YOLO-Stutter
View on GitHub
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆21Mar 4, 2025Updated last year
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
vocaliodmiku / wav2vec2mdd
View on GitHub
End-to-End Mispronunciation Detection via wav2vec2.0
☆52Dec 7, 2021Updated 4 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
kamperh / speech_dtw
View on GitHub
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30May 6, 2024Updated 2 years ago
doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
vantezzen / quill-languagetool
View on GitHub
✒️ LanguageTool integration for Quill.js editors
☆17Aug 20, 2024Updated last year
scarletcho / prep4kaldi
View on GitHub
Data preparation code for building Kaldi ASR system
☆14Mar 18, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
smart-audio / audio_diarization_annotation
View on GitHub
Audio Diarization Annotation tool
☆30Nov 8, 2019Updated 6 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
sweekarsud / Goodness-of-Pronunciation
View on GitHub
Pronunciation Evaluation
☆101Jul 20, 2025Updated last year
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
Berkeley-Speech-Group / DysfluentWFST
View on GitHub
DysfluentWFST
☆19Nov 13, 2025Updated 8 months ago