kanchen-usc / VIG
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆21Updated 4 years ago
Alternatives and similar repositories for VIG:
Users that are interested in VIG are comparing it to the libraries listed below
- ☆24Updated 4 years ago
- Audio-conditioned video texture generation☆24Updated 2 years ago
- Code for making #GANterpretations☆23Updated 4 years ago
- Code for sound synthesis☆50Updated 6 years ago
- Sequential Learning for Dance generation☆21Updated 4 years ago
- Code to perform shot detection and extraction on video☆11Updated 3 years ago
- Network specification and demo☆35Updated 7 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 3 years ago
- 2.5D visual sound dataset☆96Updated 3 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17Updated 5 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Updated 4 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆19Updated 3 years ago
- ☆24Updated 8 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Updated 5 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- multimodal transformer☆73Updated 3 years ago
- Video examples of "Appearance Composing GAN: A General Method for Appearance-Controllable Human Video Motion Transfer"☆15Updated 4 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- ☆13Updated 7 years ago
- Code for replication of the paper "GANs beyond divergence minimization"☆21Updated 6 years ago
- Implementation of VAE and Style-GAN Architecture Achieving State of the Art Reconstruction☆30Updated 2 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 4 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆106Updated 2 years ago
- Official implementation of Generating Object Stamps☆15Updated 4 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆44Updated 5 years ago