kanchen-usc / VIG
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆21Updated 5 years ago
Alternatives and similar repositories for VIG
Users that are interested in VIG are comparing it to the libraries listed below
Sorting:
- 2.5D visual sound dataset☆98Updated 3 years ago
- Code for making #GANterpretations☆23Updated 4 years ago
- Video examples of "Appearance Composing GAN: A General Method for Appearance-Controllable Human Video Motion Transfer"☆15Updated 4 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 4 years ago
- Code for sound synthesis☆50Updated 6 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆57Updated 5 years ago
- Audio-conditioned video texture generation☆24Updated 2 years ago
- Code for replication of the paper "GANs beyond divergence minimization"☆21Updated 6 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Updated 4 years ago
- Generate vector embeddings for music☆18Updated 7 years ago
- Network specification and demo☆35Updated 7 years ago
- Code to perform shot detection and extraction on video☆11Updated 3 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Updated 5 years ago
- ☆19Updated 4 years ago
- IterGANs: Iterative GANs for rotating visual objects☆13Updated 6 years ago
- Convert @NVlabs StyleGAN pkls to @taki0112 StyleGAN-Tensorflow checkpoints (copy over the weights)☆27Updated 5 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- Code for reproducing experiments in "Exploiting GAN Internal Capacity for High-Quality Reconstruction of Natural Images"☆16Updated 5 years ago
- ☆21Updated 4 years ago
- A data-driven approach for interactively synthesizing diverse images from semantic label maps.☆39Updated 5 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17Updated 5 years ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆44Updated 5 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 4 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆33Updated 4 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Whitening and Coloring transform for GANs☆35Updated 5 years ago
- MetaPix: Few-Shot Video Retargeting☆48Updated 5 years ago
- Datasets for new state-of-the-art challenge in disentanglement learning☆45Updated 5 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆106Updated 3 years ago