liusongxiang/diffsvc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liusongxiang/diffsvc)

liusongxiang / diffsvc

DiffSVC demo page

☆81

Alternatives and similar repositories for diffsvc

Users that are interested in diffsvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
Edresson / GE2E-Speaker-Encoder
View on GitHub
GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification
☆14May 17, 2020Updated 6 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
MelissaChen15 / control-vc
View on GitHub
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
☆132Nov 29, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
KinglittleQ / pitch-net
View on GitHub
Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).
☆11Apr 14, 2020Updated 6 years ago
bshall / hifigan
View on GitHub
An 16kHz implementation of HiFi-GAN for soft-vc.
☆109Jul 19, 2023Updated 3 years ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
shuheikatoinfo / UtterTune
View on GitHub
LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…
☆26Jul 8, 2026Updated 2 weeks ago
yxlllc / RMVPE
View on GitHub
☆79Mar 12, 2026Updated 4 months ago
auspicious3000 / contentvec
View on GitHub
speech self-supervised representations
☆520Apr 27, 2023Updated 3 years ago
leibniz-future-lab / SelfDistill-SER
View on GitHub
☆18Apr 28, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
ronggong / MIREX-2018-Automatic-Lyrics-to-Audio-Alignment
View on GitHub
Util code, issues, discussions
☆29Aug 31, 2018Updated 7 years ago
lsg1213 / PEAQ_python
View on GitHub
Python version of PEAQ(Perceptual Evaluation of Audio Quality)
☆14Jul 24, 2025Updated last year
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
zcf28 / StyleGAN-VC
View on GitHub
Voice Conversion method based on speaker style
☆14Aug 7, 2021Updated 4 years ago
eloimoliner / CQTdiff
View on GitHub
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
☆122Mar 14, 2023Updated 3 years ago
YoungJay0612 / Speech-Simulation-Tools
View on GitHub
语音增强领域的相关数据仿真工具和方法汇总--持续更新
☆45Jul 11, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
winddori2002 / TriAAN-VC
View on GitHub
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
☆146Jan 15, 2024Updated 2 years ago
bshall / urhythmic
View on GitHub
Unsupervised Rhythm Modeling for Voice Conversion
☆85Aug 3, 2023Updated 2 years ago
xiaozhuo12138 / PitchNet
View on GitHub
An unofficial implementation of the paper titled "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network".
☆27Apr 17, 2020Updated 6 years ago
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 5 years ago
youcaiSUN / MuSe-Wild_2020
View on GitHub
☆12Aug 24, 2020Updated 5 years ago
wolfgitpr / HubertFA
View on GitHub
Hubert-based Forced Aligner
☆54Mar 19, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AmeenAli / VideoMatch
View on GitHub
☆14Jan 5, 2022Updated 4 years ago
Kouon-Project / Kouon_Vocoder
View on GitHub
The Kouon_vocoder project is a vocoder project driven by the SVS community and the producers involved in singing synthesis.This project i…
☆18Nov 2, 2024Updated last year
yl4579 / SLMGAN
View on GitHub
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
☆16Jul 19, 2023Updated 3 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
prophesier / diff-svc
View on GitHub
Singing Voice Conversion via diffusion model
☆2,714Jun 6, 2026Updated last month
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
openvpi / DiffSingerMiniEngine
View on GitHub
A minimum inference engine for DiffSinger
☆38Apr 5, 2024Updated 2 years ago