nafiuny / ICRCycleGAN-VC
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆14Updated 10 months ago
Alternatives and similar repositories for ICRCycleGAN-VC:
Users that are interested in ICRCycleGAN-VC are comparing it to the libraries listed below
- Implementation of Emo-StarGAN☆45Updated last year
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆23Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆16Updated 3 months ago
- ☆34Updated 4 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆38Updated last year
- an implementation of FAdam (Fisher Adam) in PyTorch☆43Updated 11 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- Zero-Shot Emotion Style Transfer☆45Updated 2 weeks ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 4 months ago
- Visual Speech Recongnition☆16Updated 4 months ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆17Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 months ago
- ☆24Updated 3 months ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated 2 years ago
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆13Updated 2 years ago
- ☆25Updated 9 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated last year
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆34Updated 3 months ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated last year
- ☆29Updated last year
- ☆55Updated last year
- Official implementation of OSSGAN [CVPR 2022]☆21Updated 3 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 4 years ago
- ☆14Updated 4 years ago
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆16Updated 4 years ago
- A neural speech codec based on discrete WavLM representations☆24Updated 8 months ago
- GAN series for voice conversion on VCC2018 dataset☆16Updated 4 years ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated 2 months ago
- Supervoice Speaker Separation Network☆12Updated 11 months ago