Ldoun/DeepSinger

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ldoun/DeepSinger)

Ldoun / DeepSinger

☆36

Alternatives and similar repositories for DeepSinger

Users that are interested in DeepSinger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

leavelet / singing-database-maker
View on GitHub
AI based singing voice synthesis database generator
☆13Aug 12, 2022Updated 3 years ago
keonlee9420 / DiffSinger
View on GitHub
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
☆248Feb 3, 2022Updated 4 years ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
PlayVoice / VI-SVS
View on GitHub
Singing Voice Synthesis based on VITS, different from VISinger
☆198Nov 13, 2023Updated 2 years ago
vTAD2025-Challenge / vTAD
View on GitHub
☆16Oct 24, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SJTMusicTeam / Muskits
View on GitHub
An opensource music processing toolkit
☆320Jun 25, 2023Updated 3 years ago
neosapience / mlp-singer
View on GitHub
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
☆118Feb 24, 2022Updated 4 years ago
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
aaronng91 / semantic-turn-detection
View on GitHub
Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.
☆18May 9, 2025Updated last year
Rongjiehuang / Multi-Singer
View on GitHub
PyTorch Implementation of Multi-Singer (ACM-MM'21)
☆139May 8, 2022Updated 4 years ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
juice500ml / xlm_to_xlsr
View on GitHub
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Mar 12, 2024Updated 2 years ago
xushengyuan / FastSing2
View on GitHub
An imporved version of Fastsinging singing voice synthesising system.
☆21Nov 3, 2020Updated 5 years ago
keunwoochoi / music4all_contrib
View on GitHub
☆32Dec 29, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
genisplaja / diffusion-vocal-sep
View on GitHub
Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)
☆17Feb 16, 2023Updated 3 years ago
WelkinYang / Learn2Sing2.0
View on GitHub
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
☆182Apr 28, 2023Updated 3 years ago
chenjianyi / fastsag
View on GitHub
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
☆29Dec 19, 2024Updated last year
MTG / singing-synthesis-demos
View on GitHub
Sound examples for the Neural Parametric Singing Synthesizer (NPSS)
☆23Feb 24, 2022Updated 4 years ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
ictnlp / StreamUni
View on GitHub
StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a co…
☆22Jul 14, 2025Updated last year
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆20Sep 18, 2025Updated 10 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
ethanhe42 / dds
View on GitHub
DDS: Delta Denoising Score PyTorch implementation
☆19Sep 2, 2023Updated 2 years ago
igormq / speech2text
View on GitHub
☆12Feb 9, 2021Updated 5 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
colorful-liyu / Awesome-AI-ART-generation
View on GitHub
This is a collection of resources on AI-AR-ART generation.
☆28Dec 14, 2022Updated 3 years ago
noafish / MorphGAN
View on GitHub
☆15Jun 15, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CODEJIN / HiFiSinger
View on GitHub
☆111Jun 11, 2021Updated 5 years ago
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
gburlet / robotaba
View on GitHub
Automatic Guitar Tablature Transcription Online
☆19Nov 6, 2013Updated 12 years ago
ExplainableML / ZerAuCap
View on GitHub
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
☆19Nov 30, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
gwx314 / STARS
View on GitHub
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation
☆85Nov 11, 2025Updated 8 months ago