madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
☆13Updated 4 years ago
Alternatives and similar repositories for Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
Users that are interested in Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning are comparing it to the libraries listed below
Sorting:
- ☆25Updated 2 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆34Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Updated 5 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- ☆13Updated 8 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆51Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆25Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 5 months ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 4 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆20Updated last year
- Workflow for forced alignment between languages☆18Updated last year
- ☆12Updated 3 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- A handy dataset of noises for ASR☆21Updated 5 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆22Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆37Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 10 months ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- ☆21Updated 8 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 5 months ago