jimbozhang/speechocean762

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jimbozhang/speechocean762)

jimbozhang / speechocean762

A non-native English corpus for pronunciation scoring task

☆190

Alternatives and similar repositories for speechocean762

Users that are interested in speechocean762 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆218Feb 13, 2023Updated 3 years ago
tzyll / goparrot
View on GitHub
Goodness of Pronunciation (GOP) for oral reading assessment.
☆55Nov 17, 2021Updated 4 years ago
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
jimbozhang / kaldi-gop
View on GitHub
Kaldi-based goodness of pronunciation (GOP)
☆161Feb 4, 2021Updated 5 years ago
doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sweekarsud / Goodness-of-Pronunciation
View on GitHub
Pronunciation Evaluation
☆101Jul 20, 2025Updated last year
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
vocaliodmiku / wav2vec2mdd
View on GitHub
End-to-End Mispronunciation Detection via wav2vec2.0
☆52Dec 7, 2021Updated 4 years ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
doheejin / HiPAMA
View on GitHub
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…
☆40Apr 29, 2024Updated 2 years ago
moisesveleta / GOP-LSTM
View on GitHub
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Sep 26, 2018Updated 7 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
Shahabks / Speechat
View on GitHub
Spoken Language assessment
☆46Nov 17, 2020Updated 5 years ago
cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
View on GitHub
☆13Apr 9, 2021Updated 5 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
rudder-analytics / Goodness-of-Pronounciation
View on GitHub
☆54Apr 12, 2024Updated 2 years ago
Thiagohgl / ai-pronunciation-trainer
View on GitHub
This tool uses AI to evaluate your pronunciation.
☆509Aug 16, 2025Updated 11 months ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
tbright17 / kaldi-dnn-ali-gop
View on GitHub
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
☆236Apr 3, 2019Updated 7 years ago
Mu-Y / mpl-mdd
View on GitHub
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…
☆38Jan 23, 2024Updated 2 years ago
ai-zahran / E2E-R
View on GitHub
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆29Oct 23, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
nicolezyh / Automatic-Speech-Scoring-Paper
View on GitHub
☆10Dec 6, 2019Updated 6 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
yuwchen / MultiPA
View on GitHub
☆21Jun 25, 2026Updated last month
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
ASR-project / Multilingual-PR
View on GitHub
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…
☆266May 9, 2022Updated 4 years ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
knowitall / morpha
View on GitHub
Morpha lex stemmer converted using jflex.
☆24Oct 12, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
EducationalTestingService / rsmtool
View on GitHub
A Python package to facilitate research on building and evaluating automated scoring models.
☆71Dec 27, 2024Updated last year
jimbozhang / xares-llm-template
View on GitHub
Template for creating audio encoders compatible with X-ARES
☆19Feb 11, 2026Updated 5 months ago
OscarVanL / LibriTTS-British-Accents
View on GitHub
A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.
☆16Mar 17, 2023Updated 3 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
Shahabks / myprosody
View on GitHub
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
☆275Nov 28, 2022Updated 3 years ago