trinhtuanvubk/Diff-VC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/trinhtuanvubk/Diff-VC)

trinhtuanvubk / Diff-VC

Diffusion Model for Voice Conversion

☆72

Alternatives and similar repositories for Diff-VC

Users that are interested in Diff-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
hayeong0 / Diff-HierVC
View on GitHub
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…
☆237Jul 3, 2024Updated 2 years ago
KunZhou9646 / Emovox
View on GitHub
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆95Feb 9, 2022Updated 4 years ago
trinhtuanvubk / KWS-BCResnet
View on GitHub
Keyword Spotting using BCResNet and Arcface Loss
☆13Jan 28, 2022Updated 4 years ago
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
trinhtuanvubk / handwritten-ocr
View on GitHub
My personal implementation of SVTR model for handwritten OCR
☆14Mar 1, 2024Updated 2 years ago
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
JeffC0628 / awesome-voice-conversion
View on GitHub
A curated list of awesome voice conversion, projects and communities.
☆267Nov 18, 2025Updated 8 months ago
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
xiuwenz2 / SAPC-template
View on GitHub
☆17Mar 20, 2026Updated 4 months ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
bshall / urhythmic
View on GitHub
Unsupervised Rhythm Modeling for Voice Conversion
☆85Aug 3, 2023Updated 2 years ago
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kamepong / StarGAN-VC
View on GitHub
☆23Dec 14, 2021Updated 4 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
unilight / s3prl-vc
View on GitHub
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
☆101Mar 15, 2026Updated 4 months ago
ConsistencyVC / ConsistencyVC-voive-conversion
View on GitHub
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
☆154Oct 16, 2023Updated 2 years ago
unilight / seq2seq-vc
View on GitHub
A sequence-to-sequence voice conversion toolkit.
☆113Mar 15, 2026Updated 4 months ago
yl4579 / StyleTTS-VC
View on GitHub
Official Implementation of StyleTTS-VC
☆200Jan 14, 2025Updated last year
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lsfhuihuiff / MusicTI_AAAI2024
View on GitHub
" Music Style Transfer with Time-Varying Inversion of Diffusion Models"
☆59Jul 23, 2024Updated 2 years ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
Wendison / VQMIVC
View on GitHub
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆361Apr 27, 2022Updated 4 years ago
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆376Sep 3, 2024Updated last year
tinkoff-ai / hifi_vc
View on GitHub
☆40Jan 24, 2023Updated 3 years ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
biggytruck / SpeechSplit2
View on GitHub
Official implementation of SpeechSplit2
☆135Oct 22, 2022Updated 3 years ago
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
freds0 / free-svc
View on GitHub
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆95Jul 23, 2025Updated last year
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
liusongxiang / StarGAN-Voice-Conversion
View on GitHub
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…
☆523Oct 11, 2019Updated 6 years ago
MingjieChen / EasyVC
View on GitHub
A toolkit for any-to-any encoder-decoder voice conversion systems
☆83Aug 10, 2023Updated 2 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
KoMyeongJin / SpecDiff-GAN
View on GitHub
Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS
☆40Aug 4, 2023Updated 2 years ago
jagabandhumishra / W2V-E2E-Language-Diarization
View on GitHub
☆11Sep 4, 2023Updated 2 years ago