MoonInTheRiver/DiffSinger

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MoonInTheRiver/DiffSinger)

MoonInTheRiver / DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

☆4,828

Alternatives and similar repositories for DiffSinger

Users that are interested in DiffSinger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

openvpi / DiffSinger
View on GitHub
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…
☆3,175Jun 28, 2026Updated 3 weeks ago
MoonInTheRiver / NeuralSVB
View on GitHub
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
☆461Jan 2, 2024Updated 2 years ago
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,883Dec 6, 2023Updated 2 years ago
prophesier / diff-svc
View on GitHub
Singing Voice Conversion via diffusion model
☆2,713Jun 6, 2026Updated last month
nnsvs / nnsvs
View on GitHub
Neural network-based singing voice synthesis library for research
☆746Oct 9, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SJTMusicTeam / Muskits
View on GitHub
An opensource music processing toolkit
☆320Jun 25, 2023Updated 3 years ago
zhangyongmao / VISinger2
View on GitHub
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
☆355Nov 4, 2024Updated last year
PlayVoice / VI-SVS
View on GitHub
Singing Voice Synthesis based on VITS, different from VISinger
☆198Nov 13, 2023Updated 2 years ago
keonlee9420 / DiffSinger
View on GitHub
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
☆248Feb 3, 2022Updated 4 years ago
Rongjiehuang / Multi-Singer
View on GitHub
PyTorch Implementation of Multi-Singer (ACM-MM'21)
☆139May 8, 2022Updated 4 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
M4Singer / M4Singer
View on GitHub
☆227Dec 29, 2022Updated 3 years ago
NATSpeech / NATSpeech
View on GitHub
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and Diff…
☆1,004Apr 2, 2023Updated 3 years ago
guan-yuan / Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
View on GitHub
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…
☆484Sep 28, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / BigVGAN
View on GitHub
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,227Sep 5, 2024Updated last year
svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,147Nov 11, 2023Updated 2 years ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,357Jul 27, 2024Updated last year
PlayVoice / whisper-vits-svc
View on GitHub
Core Engine of Singing Voice Conversion & Singing Voice Clone
☆2,863Apr 23, 2024Updated 2 years ago
WelkinYang / Learn2Sing2.0
View on GitHub
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
☆182Apr 28, 2023Updated 3 years ago
PlayVoice / lora-svc
View on GitHub
singing voice change based on whisper, and lora for singing voice clone
☆648Nov 3, 2023Updated 2 years ago
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆333Feb 9, 2024Updated 2 years ago
Rongjiehuang / ProDiff
View on GitHub
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
☆432Apr 19, 2023Updated 3 years ago
wenet-e2e / opencpop
View on GitHub
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
☆236Dec 10, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
auspicious3000 / contentvec
View on GitHub
speech self-supervised representations
☆520Apr 27, 2023Updated 3 years ago
CODEJIN / HiFiSinger
View on GitHub
☆111Jun 11, 2021Updated 5 years ago
yxlllc / DDSP-SVC
View on GitHub
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
☆2,629Feb 22, 2026Updated 4 months ago
lucidrains / audiolm-pytorch
View on GitHub
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
☆2,621Jan 12, 2025Updated last year
openvpi / vocoders
View on GitHub
DiffSinger community vocoders release page
☆318Feb 27, 2025Updated last year
xunmengshe / OpenUtau
View on GitHub
☆289Sep 9, 2024Updated last year
fishaudio / fish-diffusion
View on GitHub
An easy to understand TTS / SVS / SVC framework
☆748Jun 1, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
microsoft / NeuralSpeech
View on GitHub
☆1,461Feb 11, 2024Updated 2 years ago
lucidrains / naturalspeech2-pytorch
View on GitHub
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
☆1,333Sep 24, 2023Updated 2 years ago
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆275Jul 29, 2023Updated 2 years ago
facebookresearch / encodec
View on GitHub
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
☆3,995Jan 4, 2024Updated 2 years ago
openutau / OpenUtau
View on GitHub
Open singing synthesis platform / Open source UTAU successor
☆4,098Jul 7, 2026Updated last week
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago