ai-zahran / E2E-R
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for E2E-R
- ☆25Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆46Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆14Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 10 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆14Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- ☆10Updated 2 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆22Updated 9 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Went online decode demo☆29Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆42Updated 2 years ago
- ☆16Updated 2 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆150Updated last year
- ☆86Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆28Updated 6 months ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated 3 months ago
- Neural network-based forced alignment with bidirectional attention mechanism☆70Updated 2 years ago
- ☆47Updated 2 weeks ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆26Updated 2 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- ☆35Updated 3 months ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago