DTaoo / Simplified_DMCLinks

A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)

☆19

Alternatives and similar repositories for Simplified_DMC

Users that are interested in Simplified_DMC are comparing it to the libraries listed below

Sorting:

zjsong / SSPL
PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…
☆32Updated 11 months ago
DTaoo / DMC
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15Updated 5 years ago
W-Wu / DEER
☆11Updated last year
hxixixh / mix-and-localize
☆20Updated last year
alvinliu0 / Visual-Sound-Localization-in-the-Wild
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Updated 3 years ago
OpenNLPLab / MMVAE-AVS
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆19Updated 9 months ago
OpenNLPLab / FNAC_AVL
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆25Updated 2 years ago
DTaoo / Discriminative-Sounding-Objects-Localization
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆57Updated 3 years ago
YapengTian / CCOL-CVPR21
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆25Updated 3 years ago
stoneMo / MGN
Official implementation for MGN
☆20Updated 2 years ago
SheldonTsui / SepStereo_ECCV2020
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Updated 4 years ago
YapengTian / AV-Robustness-CVPR21
Can audio-visual integration strengthen robustness under multimodal attacks?
☆28Updated 3 years ago
tomp11 / metric_learning
Metric Learning (npair loss & angular loss) on mnist and Visualizing by t_SNE
☆35Updated 2 years ago
YapengTian / AVVP-ECCV20
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
☆88Updated 11 months ago
IFICL / SLfM
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆39Updated last year
shanwangshan / TAU-urban-audio-visual-scenes
☆10Updated 3 years ago
weiguoPian / AV-CIL_ICCV2023
☆25Updated 8 months ago
Yu-Wu / Modaily-Aware-Audio-Visual-Video-Parsing
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
☆24Updated 3 years ago
caffeinism / FiLM-pytorch
PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer
☆58Updated 5 years ago
yunyikristy / CM-ACC
Cross-model active contrastive coding
☆22Updated 4 years ago
dkurzend / ClipClap-GZSL
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
☆17Updated last year
stoneMo / EZ-VSL
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆34Updated 2 years ago
HimangiM / RepLAI
Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…
☆12Updated 2 years ago
xiaobai1217 / DomainAdaptation
CVPR2022
☆21Updated 2 years ago
DTaoo / Multimodal-Aerial-Scene-Recognition
Code for <Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition> (ECCV 2020)
☆36Updated 4 years ago
yyf17 / SAAVN
SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)
☆19Updated 2 years ago
salesforce / CAST
☆19Updated last month
HumamAlwassel / XDC
Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)
☆90Updated 2 years ago
baopj / DenseEventsGrounding
☆17Updated last year
stoneMo / SLAVC
Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)
☆17Updated 2 years ago