Mu-Y / mpl-mddLinks
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
☆34Updated last year
Alternatives and similar repositories for mpl-mdd
Users that are interested in mpl-mdd are comparing it to the libraries listed below
Sorting:
- End-to-End Mispronunciation Detection via wav2vec2.0☆49Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆63Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- Clustering-based methods for overlapping diarization☆82Updated last year
- ☆80Updated 4 months ago
- A sequence-to-sequence voice conversion toolkit.☆106Updated last year
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆137Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 5 years ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Updated 6 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- ☆111Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Updated 3 years ago
- Official implementation of SpeechSplit2☆133Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆102Updated 8 months ago
- multilingual speech aligner☆77Updated 2 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Updated 3 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆62Updated 2 years ago
- Alignment files of LibriTTS.☆66Updated 5 years ago
- Speech (audio) subjective evaluation system☆42Updated 5 years ago
- ☆121Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago