dunbar12138/Audiovisual-Synthesis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dunbar12138/Audiovisual-Synthesis)

dunbar12138 / Audiovisual-Synthesis

Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders

☆123

Alternatives and similar repositories for Audiovisual-Synthesis

Users that are interested in Audiovisual-Synthesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVlabs / wc-vid2vid
View on GitHub
☆19Feb 1, 2021Updated 5 years ago
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
joansj / blow
View on GitHub
Code to train and run Blow
☆145Sep 4, 2019Updated 6 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
omarperacha / GANkyoku
View on GitHub
A Generative Adversarial Network for Shakuhachi Music
☆14Jul 2, 2019Updated 7 years ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆870Jul 22, 2023Updated 2 years ago
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
r9y9 / sinsy
View on GitHub
A fork of sinsy: HMM/DNN-based singing voice synthesis system
☆74Feb 6, 2022Updated 4 years ago
ncsoft / avocodo
View on GitHub
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
☆154Feb 1, 2023Updated 3 years ago
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
KimythAnly / AGAIN-VC
View on GitHub
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…
☆114Dec 7, 2020Updated 5 years ago
jaywalnut310 / waveglow-vqvae
View on GitHub
WaveGlow vocoder with VQVAE
☆61Jun 18, 2019Updated 7 years ago
yanggeng1995 / FB-MelGAN
View on GitHub
A pytroch implementation of the FB-MelGAN
☆90May 26, 2020Updated 6 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
paarthneekhara / advoc
View on GitHub
Vocode spectrograms to audio with generative adversarial networks
☆64Aug 8, 2019Updated 6 years ago
pc2752 / Multi_Voice_Sing_Speak_Sing
View on GitHub
☆24Mar 24, 2023Updated 3 years ago
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
hyperconnect / MarioNETte
View on GitHub
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
☆39Nov 21, 2019Updated 6 years ago
bfs18 / nsynth_wavenet
View on GitHub
parallel wavenet based on nsynth
☆106Dec 14, 2018Updated 7 years ago
keonlee9420 / STYLER
View on GitHub
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…
☆159Jun 5, 2025Updated last year
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
Hangz-nju-cuhk / Talking-Face-Generation-DAVS
View on GitHub
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
☆813May 11, 2021Updated 5 years ago
voidful / vall-e-encodec
View on GitHub
☆41May 15, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
ariacat3366 / ACVAE-VC
View on GitHub
☆22Jan 15, 2019Updated 7 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
Bartzi / one-model-to-reconstruct-them-all
View on GitHub
Code for our Paper "One Model to Reconstruct Them All: A Novel Way to Use the Stochastic Noise in StyleGAN"
☆73Nov 17, 2020Updated 5 years ago
unilight / cdvae-vc
View on GitHub
TensorFlow Implementation of CDVAE-VC.
☆54Mar 24, 2023Updated 3 years ago
azraelkuan / tensorflow_wavenet_vocoder
View on GitHub
wavenet vocoder using tensorflow
☆26Feb 18, 2018Updated 8 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,099Oct 23, 2024Updated last year