Grace9994 / CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
☆132Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for CoMoSVC
- CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆183Updated 6 months ago
- Train the next generation of TTS systems.☆161Updated 2 months ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆199Updated 4 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆134Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆232Updated 8 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆145Updated last month
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- Training code for FAcodec presented in NaturalSpeech3☆179Updated 2 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆134Updated last year
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…☆198Updated 3 months ago
- ☆101Updated last month
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆167Updated 7 months ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆135Updated 6 months ago
- ☆222Updated 9 months ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆136Updated last year
- ☆140Updated 10 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆90Updated 2 years ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆75Updated 2 months ago
- Official Implementation of StyleTTS-VC☆164Updated last year
- ☆83Updated 2 months ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆86Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last month
- ☆65Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆126Updated 8 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆66Updated last year
- SOFA: Singing-Oriented Forced Aligner☆138Updated last week
- Easy-to-Use Speech MOS predictors☆231Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆98Updated 3 weeks ago