nafiuny / ICRCycleGAN-VCLinks
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆14Updated last week
Alternatives and similar repositories for ICRCycleGAN-VC
Users that are interested in ICRCycleGAN-VC are comparing it to the libraries listed below
Sorting:
- Implementation of Emo-StarGAN☆45Updated last year
- An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.☆12Updated 4 months ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆26Updated 2 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Updated last year
- The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)☆20Updated 2 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 5 years ago
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆13Updated 3 years ago
- ☆15Updated 6 months ago
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆16Updated 4 years ago
- ☆17Updated 3 weeks ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆15Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- ☆36Updated 5 years ago
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆12Updated 2 years ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆49Updated 4 months ago
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆32Updated 2 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Updated 2 years ago
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆16Updated 2 weeks ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆18Updated 9 months ago
- Official PyTorch repository for Hypercomplex Image-to-Image Transaltion☆19Updated 2 years ago
- The project page repo for Neural Dubber.☆30Updated 2 years ago
- Talking head animation☆28Updated last year
- Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.☆17Updated 3 years ago
- speaker-disentangled speech linguistic content quantizer☆22Updated 7 months ago
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- The official code of WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation (ECCV2022)☆78Updated 2 years ago
- This is a collection of resources on AI-AR-ART generation.☆29Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆15Updated last year
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29Updated 2 years ago