hangzhaomit / Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
☆371Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Sound-of-Pixels
- MUSIC Dataset from The Sound of Pixels (ECCV '18)☆119Updated 2 years ago
- 2.5D visual sound☆110Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆292Updated 3 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- ☆223Updated 4 years ago
- 2.5D visual sound dataset☆92Updated 3 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆94Updated last year
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆377Updated 3 years ago
- A curated list of different papers and datasets in various areas of audio-visual processing☆671Updated 9 months ago
- A UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.☆459Updated 3 years ago
- A neural network for end-to-end music source separation☆225Updated 4 years ago
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆84Updated 3 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- Voice Converter Using CycleGAN and Non-Parallel Data☆526Updated last year
- TensorFlow implementation of "GANSynth: Adversarial Neural Audio Synthesis"☆66Updated 5 years ago
- Implementation of the Wave-U-Net for audio source separation☆844Updated last year
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- Deep Convolutional Neural Networks for Musical Source Separation☆472Updated 4 years ago
- This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…☆515Updated 5 years ago
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆349Updated 4 months ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 2 years ago
- ☆403Updated last year
- OpenL3: Open-source deep audio and image embeddings☆468Updated last year
- Audio-Visual Speech Separation with Cross-Modal Consistency☆221Updated last year
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆380Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆127Updated 3 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆111Updated 4 years ago
- A simplified PyTorch implementation of GANsynth☆81Updated 5 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆647Updated last month