zy-du/Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zy-du/Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion)

zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion".

☆21

Alternatives and similar repositories for Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

Users that are interested in Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
Chien-Hung / Speech-Emotion-Recognition
View on GitHub
3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.
☆44Nov 13, 2020Updated 5 years ago
gayanechilingar / Change-Emotions
View on GitHub
Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811…
☆14Oct 13, 2021Updated 4 years ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
ConsistencyVC / ConsistencyVC-voive-conversion
View on GitHub
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
☆154Oct 16, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vtuber-plan / FlowVAE
View on GitHub
☆17Dec 12, 2023Updated 2 years ago
KunZhou9646 / Emovox
View on GitHub
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆95Feb 9, 2022Updated 4 years ago
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
Wendison / VQMIVC
View on GitHub
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆361Apr 27, 2022Updated 4 years ago
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 9 months ago
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
dhchoi99 / NANSY
View on GitHub
☆171Jul 25, 2022Updated 4 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
X-E-Speech / X-E-Speech-code
View on GitHub
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆112Apr 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
light1726 / SpeechTripleNet
View on GitHub
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆33Nov 23, 2023Updated 2 years ago
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
gallilmaimon / DISSC
View on GitHub
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730
☆130Dec 8, 2023Updated 2 years ago
hcy71o / SC-VITS
View on GitHub
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
☆36Sep 21, 2022Updated 3 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
CZ26 / CycleTransGAN-EVC
View on GitHub
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
☆35Feb 4, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
winddori2002 / TriAAN-VC
View on GitHub
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
☆146Jan 15, 2024Updated 2 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
rendchevi / daisy-tts
View on GitHub
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆14Nov 15, 2025Updated 8 months ago
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
p1an-lin-jung / WavThruVec_pytorch
View on GitHub
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
☆29Sep 6, 2023Updated 2 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
ORI-Muchim / BEGANSing
View on GitHub
BEGANSing - Korean SVS + SVC + AudioSR
☆11Feb 17, 2024Updated 2 years ago
HappyColor / DrawSpeech_PyTorch
View on GitHub
☆25Nov 25, 2025Updated 8 months ago
PlayVoice / VI-SVC
View on GitHub
VI-SVC model is just VITS without MAS and DurationPredictor.
☆10Nov 9, 2023Updated 2 years ago