kanchen-usc / VIG
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆21Updated 4 years ago
Alternatives and similar repositories for VIG:
Users that are interested in VIG are comparing it to the libraries listed below
- Code for making #GANterpretations☆23Updated 4 years ago
- 2.5D visual sound dataset☆94Updated 3 years ago
- Audio-conditioned video texture generation☆24Updated 2 years ago
- Code for sound synthesis☆50Updated 6 years ago
- Video examples of "Appearance Composing GAN: A General Method for Appearance-Controllable Human Video Motion Transfer"☆15Updated 4 years ago
- Implementation of VAE and Style-GAN Architecture Achieving State of the Art Reconstruction☆30Updated last year
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Updated 2 years ago
- List of papers about TTS / Список статей о TTS☆10Updated 7 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- MetaPix: Few-Shot Video Retargeting☆48Updated 5 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Updated 3 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Code for replication of the paper "GANs beyond divergence minimization"☆21Updated 6 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Author's implementation of "Visual Element Discovery as Discriminative Mode Seeking," Doersch, Gupta & Efros, NIPS 2013☆20Updated 10 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆120Updated 2 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆19Updated 3 years ago
- Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)☆19Updated 4 years ago
- Code to perform shot detection and extraction on video☆11Updated 3 years ago
- A CLIP conditioned Decision Transformer.☆22Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- Datasets for new state-of-the-art challenge in disentanglement learning☆45Updated 5 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Updated 5 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆56Updated 5 years ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆44Updated 5 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆63Updated 3 years ago
- FactorGAN - Training GANs with missing data☆36Updated 6 months ago