smtiitm / Fastspeech2_MFA
Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
☆14Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Fastspeech2_MFA
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆61Updated last week
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆126Updated 8 months ago
- Official Implementation of StyleTTS-VC☆164Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆93Updated 2 weeks ago
- ☆69Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- The official implementation of EmoSphere++☆27Updated 2 weeks ago
- ☆50Updated 9 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆48Updated 3 weeks ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆199Updated 4 months ago
- ☆73Updated last month
- ☆70Updated last year
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆50Updated last week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 8 months ago
- ☆62Updated last year
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 8 months ago
- ☆45Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆49Updated this week
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- The official implementation of EmoSphere-TTS☆85Updated 3 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- ☆57Updated 2 months ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- Reference-aware automatic speech evaluation toolkit☆109Updated 9 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆122Updated 5 months ago
- ☆81Updated 2 months ago
- A sequence-to-sequence voice conversion toolkit.☆86Updated 4 months ago