nafiuny / ICRCycleGAN-VCLinks
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Updated 2 months ago
Alternatives and similar repositories for ICRCycleGAN-VC
Users that are interested in ICRCycleGAN-VC are comparing it to the libraries listed below
Sorting:
- Implementation of Emo-StarGAN☆45Updated 2 years ago
- Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.☆17Updated 3 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 5 years ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Updated 2 years ago
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆16Updated 5 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Updated 11 months ago
- ☆37Updated 5 years ago
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆12Updated 2 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.☆12Updated 6 months ago
- speaker-disentangled speech linguistic content quantizer☆24Updated 9 months ago
- Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Updated 5 years ago
- ☆23Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 3 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆49Updated 6 months ago
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆20Updated 3 weeks ago
- Official implementation of OSSGAN [CVPR 2022]☆21Updated 3 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 10 months ago
- ☆14Updated 4 years ago
- ☆41Updated 5 months ago
- ☆28Updated 11 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- This is a collection of resources on AI-AR-ART generation.☆28Updated 3 years ago
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆15Updated 2 years ago
- ☆22Updated last year
- ☆23Updated 4 years ago