DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆4,746Mar 19, 2025Updated last year
Alternatives and similar repositories for DiffSinger
Users that are interested in DiffSinger are comparing it to the libraries listed below
Sorting:
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆3,077Feb 7, 2026Updated last month
- Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code☆458Jan 2, 2024Updated 2 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,840Dec 6, 2023Updated 2 years ago
- Singing Voice Conversion via diffusion model☆2,719Dec 14, 2025Updated 3 months ago
- Neural network-based singing voice synthesis library for research☆742Oct 9, 2023Updated 2 years ago
- An opensource music processing toolkit☆319Jun 25, 2023Updated 2 years ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆353Nov 4, 2024Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆196Nov 13, 2023Updated 2 years ago
- PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)☆247Feb 3, 2022Updated 4 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago
- ☆226Dec 29, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆474Sep 28, 2022Updated 3 years ago
- A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and Diff…☆1,006Apr 2, 2023Updated 2 years ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,195Sep 5, 2024Updated last year
- SoftVC VITS Singing Voice Conversion☆28,027Nov 11, 2023Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,329Jul 27, 2024Updated last year
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆181Apr 28, 2023Updated 2 years ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,849Apr 23, 2024Updated last year
- singing voice change based on whisper, and lora for singing voice clone☆649Nov 3, 2023Updated 2 years ago
- PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.☆330Feb 9, 2024Updated 2 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆602Sep 18, 2023Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,637Apr 22, 2024Updated last year
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆232Dec 10, 2025Updated 3 months ago
- speech self-supervised representations☆519Apr 27, 2023Updated 2 years ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,497Feb 22, 2026Updated 3 weeks ago
- DiffSinger community vocoders release page☆304Feb 27, 2025Updated last year
- ☆112Jun 11, 2021Updated 4 years ago
- ☆286Sep 9, 2024Updated last year
- An easy to understand TTS / SVS / SVC framework☆734Mar 2, 2026Updated 2 weeks ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- ☆1,459Feb 11, 2024Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- Open singing synthesis platform / Open source UTAU successor☆3,651Mar 12, 2026Updated last week
- State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.☆3,917Jan 4, 2024Updated 2 years ago
- PyTorch Implementation of FastDiff (IJCAI'22)☆415Jun 20, 2024Updated last year
- List of speech synthesis papers.☆1,068Jul 24, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,161Oct 27, 2023Updated 2 years ago