GalaxieT / MFA-mandarin-pinyin-dict-for-pretrained-mfa-model-v2.0
A dictionary for Montreal-Forced-Aligner users to align mandarin data labeled in pinyin form using the mfa pretrained model v2.0.
☆13Updated 4 months ago
Alternatives and similar repositories for MFA-mandarin-pinyin-dict-for-pretrained-mfa-model-v2.0:
Users that are interested in MFA-mandarin-pinyin-dict-for-pretrained-mfa-model-v2.0 are comparing it to the libraries listed below
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆84Updated 2 years ago
- ☆75Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated last year
- ☆63Updated last year
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆163Updated 11 months ago
- ☆51Updated 5 months ago
- ☆112Updated 2 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Updated 4 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Updated 11 months ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆88Updated 3 years ago
- ☆64Updated last year
- ☆56Updated last year
- An 16kHz implementation of HiFi-GAN for soft-vc.☆98Updated last year
- ☆65Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- ☆36Updated 2 weeks ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆139Updated 11 months ago
- ☆55Updated 2 years ago
- ☆30Updated last year
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆72Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- ☆21Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Predict prosody labels for Chinese sentences.☆41Updated 2 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- The official source code of UniAudio☆91Updated last year
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆132Updated 6 months ago
- ☆69Updated last year