Mu-Y / mpl-mddLinks

[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment

☆29

Alternatives and similar repositories for mpl-mdd

Users that are interested in mpl-mdd are comparing it to the libraries listed below

Sorting:

vocaliodmiku / wav2vec2mdd
End-to-End Mispronunciation Detection via wav2vec2.0
☆47Updated 3 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆61Updated 4 years ago
lstrgar / self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
☆55Updated 2 years ago
b04901014 / FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆88Updated 3 years ago
KunZhou9646 / seq2seq-EVC
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆85Updated 2 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
rhss10 / joint-apa-mdd-mtl
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆21Updated last year
guanlongzhao / ppg-gmm
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Updated 5 years ago
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆63Updated 2 years ago
spring-media / DeepForcedAligner
☆80Updated last year
prosodylab / prosobeast-annotation-tool
☆40Updated 3 years ago
jindongwang / EasyEspnet
Making Espnet easier to use
☆56Updated 4 years ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
Tomiinek / Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆44Updated 5 years ago
JazminVidal / gop-dnn-epadb
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Updated last year
MarceloSancinetti / epa-gop-pykaldi
☆25Updated 3 years ago
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆54Updated 2 years ago
lingjzhu / clap-ipa
Keyword spotting and forced alignment in any language
☆63Updated 3 weeks ago
vocaliodmiku / wav2vec2mdd-Text
☆18Updated 3 years ago
andi611 / CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
☆23Updated 6 years ago
ldong1111 / GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆46Updated 3 years ago
Daisyqk / Automatic-Prosody-Annotation
☆111Updated 3 years ago
unilight / seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
☆102Updated last year
bigpon / SpeechSubjectiveTest
Speech (audio) subjective evaluation system
☆40Updated 5 years ago
stefantaubert / mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆55Updated 2 months ago
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆55Updated 2 years ago
vectominist / spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆56Updated 2 years ago
CSTR-Edinburgh / qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
☆36Updated last year
thuhcsi / NeuFA
Neural network-based forced alignment with bidirectional attention mechanism
☆77Updated 6 months ago