ZaVang / GPT-SoVits
☆18Updated 3 months ago
Related projects: ⓘ
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated 10 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆69Updated 2 months ago
- ☆64Updated last year
- ☆13Updated 3 months ago
- ☆97Updated 2 weeks ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆132Updated 11 months ago
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆125Updated 5 months ago
- Huawei Grad-TTS for Chinese☆43Updated 11 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆73Updated last year
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆68Updated 2 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆61Updated last month
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆58Updated last week
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆124Updated 10 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆64Updated 5 months ago
- CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆177Updated 4 months ago
- The deme page of InstructTTS☆155Updated 7 months ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆82Updated 11 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆43Updated 2 months ago
- VC Without Retrain!☆91Updated 4 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆119Updated 2 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆118Updated 3 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆71Updated 3 weeks ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆188Updated 2 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆60Updated 5 months ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆13Updated 2 years ago
- ☆18Updated last year
- ☆66Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆79Updated 2 months ago
- All generative model in one for better TTS model☆64Updated last week
- Unoffical implementation of Megatts2☆256Updated 5 months ago