yuwchen / MultiPA
☆10Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for MultiPA
- Goodness of Pronunciation algorithm using PyKaldi☆14Updated 2 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆20Updated last year
- ☆25Updated 2 years ago
- ☆11Updated 3 months ago
- Official Code for ParrotTTS☆43Updated last month
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Goodness of Pronunciation (GOP) for oral reading assessment.☆46Updated 3 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆28Updated 6 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆71Updated 7 months ago
- Update ASR paper everyday☆54Updated this week
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆22Updated 10 months ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- The official implementation of EmoSphere++☆41Updated 2 weeks ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆26Updated 7 months ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆14Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆14Updated this week
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- ☆47Updated 3 weeks ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago
- ☆65Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆81Updated this week
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 5 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆46Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- Speech samples and code of BEdit-TTS☆32Updated last year