KunZhou9646/Emovox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KunZhou9646/Emovox)

KunZhou9646 / Emovox

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

☆95

Alternatives and similar repositories for Emovox

Users that are interested in Emovox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CZ26 / CycleTransGAN-EVC
View on GitHub
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
☆35Feb 4, 2025Updated last year
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
KunZhou9646 / Mixed_Emotions
View on GitHub
☆123Oct 24, 2022Updated 3 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
glam-imperial / EmotionalConversionStarGAN
View on GitHub
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…
☆136Oct 24, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KunZhou9646 / emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
View on GitHub
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…
☆124Dec 14, 2020Updated 5 years ago
gayanechilingar / Change-Emotions
View on GitHub
Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811…
☆14Oct 13, 2021Updated 4 years ago
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
ttslr / StrengthNet
View on GitHub
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
☆83Nov 4, 2022Updated 3 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
bottlecapper / EmoCycleGAN
View on GitHub
Emotional Speech Conversion using Nonparallel Data
☆17Apr 10, 2019Updated 7 years ago
chaitanya100100 / Relative-Attributes-Zero-Shot-Learning
View on GitHub
Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HLTSingapore / Emotional-Speech-Data
View on GitHub
This is the GitHub page for publicly available emotional speech data.
☆403Jan 6, 2022Updated 4 years ago
hs-oh-prml / DiffProsody
View on GitHub
☆69Jul 29, 2023Updated 3 years ago
walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
keonlee9420 / Expressive-FastSpeech2
View on GitHub
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…
☆322Aug 25, 2021Updated 4 years ago
Wendison / VQMIVC
View on GitHub
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆361Apr 27, 2022Updated 4 years ago
hhguo / EA-SVC
View on GitHub
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
☆125Nov 4, 2020Updated 5 years ago
X-E-Speech / X-E-Speech-code
View on GitHub
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆112Apr 1, 2024Updated 2 years ago
gteu / realtime-ppg-vc
View on GitHub
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆29Mar 3, 2022Updated 4 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
Choddeok / EmoSphere-TTS
View on GitHub
[INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …
☆182Jul 16, 2026Updated last week
b04901014 / FG-transformer-TTS
View on GitHub
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆90Mar 5, 2022Updated 4 years ago
hrnoh / f0-autovc
View on GitHub
Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"
☆29Nov 6, 2020Updated 5 years ago
Choddeok / EmoSpherepp
View on GitHub
[TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…
☆129Jul 16, 2026Updated last week
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 5 years ago
keonlee9420 / Cross-Speaker-Emotion-Transfer
View on GitHub
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…
☆194Nov 9, 2022Updated 3 years ago
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ConsistencyVC / ConsistencyVC-voive-conversion
View on GitHub
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
☆154Oct 16, 2023Updated 2 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
carl-robinson / voice-emotion-seq2seq
View on GitHub
Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.
☆27Oct 30, 2018Updated 7 years ago
keonlee9420 / DailyTalk
View on GitHub
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
☆260Jun 5, 2025Updated last year
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year