innnky / MagVITS
VITS with phoneme-level prosody modeling based on MaskGIT
☆71Updated 2 weeks ago
Related projects: ⓘ
- ☆92Updated last month
- ☆38Updated 2 weeks ago
- ☆13Updated 3 months ago
- ☆98Updated 3 weeks ago
- ☆54Updated 11 months ago
- ☆13Updated last year
- ☆65Updated 2 weeks ago
- SOFA: Singing-Oriented Forced Aligner☆118Updated this week
- Music generation☆24Updated 4 months ago
- All generative model in one for better TTS model☆64Updated last week
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆92Updated last week
- ☆37Updated 11 months ago
- ☆27Updated 10 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆79Updated 2 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆42Updated 5 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆124Updated 10 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆73Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated 10 months ago
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆125Updated 5 months ago
- ☆24Updated last year
- Pipelines and tools to build your own DiffSinger dataset.☆86Updated 5 months ago
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆64Updated last month
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆70Updated 2 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆44Updated last month
- 基于vits fastspeech2 visinger的tts模型☆23Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆43Updated 2 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆72Updated this week
- ☆64Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆69Updated 2 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆111Updated 6 months ago