madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
☆12Updated 3 years ago
Alternatives and similar repositories for Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning:
Users that are interested in Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning are comparing it to the libraries listed below
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆32Updated 9 months ago
- ☆25Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆11Updated last year
- Goodness of Pronunciation (GOP) for oral reading assessment.☆47Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆34Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆24Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆12Updated 5 months ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆36Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- ☆17Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆29Updated 9 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Workflow for forced alignment between languages☆17Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated last month
- ☆12Updated 6 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 3 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆17Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- ☆12Updated 3 weeks ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆31Updated last year
- ☆11Updated 3 years ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆11Updated 2 months ago