JazminVidal / gop-pykaldiLinks
Goodness of Pronunciation algorithm using PyKaldi
☆18Updated 3 years ago
Alternatives and similar repositories for gop-pykaldi
Users that are interested in gop-pykaldi are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- A handy dataset of noises for ASR☆22Updated 6 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- ☆13Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 5 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- ☆14Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated 2 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- ☆19Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated 2 years ago
- ☆29Updated last year
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 7 months ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 4 years ago
- ☆26Updated 2 months ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆49Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated last year