roudimit / MUSIC_dataset
MUSIC Dataset from The Sound of Pixels (ECCV '18)
☆119Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for MUSIC_dataset
- 2.5D visual sound dataset☆92Updated 3 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆94Updated last year
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- 2.5D visual sound☆110Updated last year
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆68Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆291Updated 3 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆52Updated 3 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆56Updated 5 years ago
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆84Updated 3 years ago
- ☆26Updated 3 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 2 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆62Updated 3 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Updated 3 years ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆85Updated last year
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆111Updated 4 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆63Updated 6 years ago
- Content-Based Video-Music Retrieval using Soft Intra-Modal Structure Constraint☆61Updated 7 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆120Updated 2 years ago
- Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"☆84Updated 4 years ago
- A simplified PyTorch implementation of GANsynth☆81Updated 5 years ago
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆106Updated 11 months ago
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆31Updated 3 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆83Updated 5 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- ☆26Updated 2 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆22Updated last year
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆174Updated 4 years ago
- Spatial Audio Generation☆100Updated last year
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆69Updated 3 years ago