dunbar12138 / Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
☆120Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Audiovisual-Synthesis
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- 2.5D visual sound dataset☆92Updated 3 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆87Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆87Updated 2 years ago
- 2.5D visual sound☆110Updated last year
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 2 years ago
- Talking Face Generation by Conditional Recurrent Adversarial Network☆61Updated 4 years ago
- ☆35Updated 6 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆56Updated 5 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆30Updated 5 years ago
- AVSpeech downloader☆66Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆68Updated 5 years ago
- Information Distillation Generative Adversrial Network in PyTorch☆27Updated 4 years ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Updated 3 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆31Updated 3 years ago
- LPC Utility for Pytorch Library.☆43Updated 3 months ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆62Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- Demo audio of VARA-TTS model☆20Updated 3 years ago
- A pytorch implementation of StarGAN-VC2☆146Updated 4 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆104Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆103Updated 5 months ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆43Updated 5 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Updated 2 years ago
- ☆28Updated 4 years ago
- ☆48Updated last year
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 3 months ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 4 years ago