ai-zahran / E2E-R
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆26Updated last year
Alternatives and similar repositories for E2E-R:
Users that are interested in E2E-R are comparing it to the libraries listed below
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆20Updated 5 months ago
- ☆25Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆50Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- ☆17Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 3 years ago
- ☆13Updated 2 weeks ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆44Updated 3 years ago
- ☆91Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆19Updated last year
- A list of papers for child ASR☆39Updated 6 months ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Updated 6 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 4 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆170Updated 2 years ago
- ☆25Updated 3 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 4 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 6 months ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 6 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆32Updated 2 weeks ago