roudimit / MUSIC_datasetLinks

MUSIC Dataset from The Sound of Pixels (ECCV '18)

☆129

Alternatives and similar repositories for MUSIC_dataset

Users that are interested in MUSIC_dataset are comparing it to the libraries listed below

Sorting:

facebookresearch / FAIR-Play
2.5D visual sound dataset
☆99Updated 3 years ago
rhgao / co-separation
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆96Updated 2 years ago
SheldonTsui / SepStereo_ECCV2020
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Updated 4 years ago
facebookresearch / 2.5D-Visual-Sound
2.5D visual sound
☆114Updated 2 years ago
rohitrango / objects-that-sound
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
☆83Updated 7 years ago
ardasnck / learning_to_localize_sound_source
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆92Updated 8 months ago
hche11 / VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
☆324Updated 3 years ago
Hangz-nju-cuhk / Vision-Infused-Audio-Inpainter-VIAI
Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆57Updated 5 years ago
afourast / avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆113Updated 4 years ago
pedro-morgado / AVSpatialAlignment
☆29Updated 3 years ago
hangzhaomit / Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
☆385Updated 3 years ago
facebookresearch / EasyComDataset
The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…
☆120Updated last year
SheldonTsui / PseudoBinaural_CVPR2021
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆65Updated 4 years ago
hche11 / Localizing-Visual-Sounds-the-Hard-Way
Localizing Visual Sounds the Hard Way
☆81Updated 3 years ago
DTaoo / Discriminative-Sounding-Objects-Localization
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆58Updated 3 years ago
dharwath / DAVEnet-pytorch
Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch
☆65Updated 6 years ago
v-iashin / SparseSync
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
☆51Updated last year
shlizee / Audeo
☆28Updated 4 years ago
sangho-vision / acav100m
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.
☆58Updated 3 years ago
YuanGongND / uavm
Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".
☆55Updated 2 years ago
EGO4D / audio-visual
☆66Updated 2 years ago
rhgao / Deep-MIML-Network
Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)
☆51Updated 5 years ago
ekazakos / auditory-slow-fast
Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
☆74Updated 3 years ago
PeihaoChen / regnet
Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…
☆53Updated 4 years ago
audio-captioning / clotho-dataset
Python code for handling the Clotho dataset.
☆81Updated 4 years ago
facebookresearch / AVID-CMA
Audio Visual Instance Discrimination with Cross-Modal Agreement
☆129Updated 3 years ago
YapengTian / CCOL-CVPR21
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆25Updated 3 years ago
akoepke / audio-retrieval-benchmark
Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".
☆51Updated 3 weeks ago
zfang399 / AlignNet
AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)
☆33Updated 4 years ago
YapengTian / AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆186Updated 4 years ago