kanchen-usc / VIG
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆21Updated 5 years ago
Alternatives and similar repositories for VIG:
Users that are interested in VIG are comparing it to the libraries listed below
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- 2.5D visual sound dataset☆97Updated 3 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆57Updated 5 years ago
- Code for sound synthesis☆50Updated 6 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Updated 5 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Code for making #GANterpretations☆23Updated 4 years ago
- Network specification and demo☆35Updated 7 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Updated 4 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 4 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- Code for "Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation"☆26Updated 5 years ago
- List of papers about TTS / Список статей о TTS☆10Updated 7 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆19Updated 3 years ago
- Representation learning for NLP @ JSALT19☆38Updated 4 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- ☆16Updated 3 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 7 years ago
- ☆13Updated 7 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- ☆31Updated 6 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆106Updated 3 years ago
- Audio-conditioned video texture generation☆24Updated 2 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17Updated 5 years ago
- Lifelong Variational Autoencoder☆14Updated 7 years ago