jimbozhang / speechocean762View external linksLinks
A non-native English corpus for pronunciation scoring task
☆166Oct 26, 2025Updated 3 months ago
Alternatives and similar repositories for speechocean762
Users that are interested in speechocean762 are comparing it to the libraries listed below
Sorting:
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆197Feb 13, 2023Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Nov 17, 2021Updated 4 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Pronunciation Evaluation☆98Jul 20, 2025Updated 6 months ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- Kaldi-based goodness of pronunciation (GOP)☆157Feb 4, 2021Updated 5 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆63Apr 29, 2021Updated 4 years ago
- ☆13Apr 9, 2021Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- Spoken Language assessment☆46Nov 17, 2020Updated 5 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Oct 23, 2023Updated 2 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆235Apr 3, 2019Updated 6 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆35Feb 5, 2026Updated last week
- ☆20Apr 12, 2025Updated 10 months ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆28Jan 28, 2026Updated 2 weeks ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆257May 9, 2022Updated 3 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- ☆38May 13, 2020Updated 5 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 5 years ago
- Deep learning based speech and pronunciation assessment API for 8 languages.☆57Jun 6, 2024Updated last year
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- ☆27Mar 29, 2021Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- List of NN based singal processing papers☆22Jun 5, 2023Updated 2 years ago