xianghenghe / Improved_StarGAN_Emotional_Voice_ConversionLinks

The official PyTorch implementation of paper: An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation (Interspeech 2021)

☆9

Alternatives and similar repositories for Improved_StarGAN_Emotional_Voice_Conversion

Users that are interested in Improved_StarGAN_Emotional_Voice_Conversion are comparing it to the libraries listed below

Sorting:

sarulab-speech / multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Updated 4 years ago
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Updated 2 years ago
hrnoh / f0-autovc
Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"
☆29Updated 4 years ago
Kahsolt / TransTacoS-RetuneGAN
A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.
☆15Updated 3 years ago
anton-kashkin / hifi_vc
☆25Updated 2 years ago
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
walker-hyf / FCTalker
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆25Updated last year
yuan1615 / AdaVocoder
Adaptive Vocoder for Custom Voice
☆60Updated 2 years ago
gteu / realtime-ppg-vc
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆28Updated 3 years ago
NVIDIA / elucidated-text-to-audio
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…
☆53Updated last month
Labmem-Zhouyx / CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Updated 2 years ago
shang0712 / HierTTS
☆45Updated 2 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆61Updated last year
light1726 / SpeechTripleNet
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆34Updated last year
xinshengwang / ICASSP2021_paper_list-VC
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Updated 4 years ago
light1726 / BetaVAE_VC
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆42Updated 2 years ago
WangHelin1997 / DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆37Updated last year
lmxue / ICASSP2022_TTS_VC_Summary
ICASSP2022 TTS&VC Summary
☆14Updated 3 years ago
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
MWM-io / nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
☆50Updated last year
thuhcsi / icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Updated 2 years ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆25Updated 2 months ago
xcmyz / ConvTasNet4BasisMelGAN
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Updated 4 years ago
Dapwner / CVAE-Tacotron
☆24Updated last year
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆55Updated last year
p0p4k / Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
☆70Updated last year
chomeyama / HN-UnifiedSourceFilterGAN
☆87Updated 2 years ago
rhoposit / icassp2021
☆15Updated 4 years ago
chomeyama / UnifiedSourceFilterGAN
☆19Updated 3 years ago
hcy71o / SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Updated last year