yanbeic / CCLLinks

PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

☆89

Alternatives and similar repositories for CCL

Users that are interested in CCL are comparing it to the libraries listed below

Sorting:

HumamAlwassel / XDC
Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)
☆90Updated 2 years ago
ninatu / everything_at_once
Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval." CVPR 2022
☆109Updated 3 years ago
YapengTian / AVVP-ECCV20
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
☆88Updated last year
jasongief / PSP_CVPR_2021
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
☆42Updated 3 years ago
GeWu-Lab / MUSIC-AVQA
MUSIC-AVQA, CVPR2022 (ORAL)
☆87Updated 2 years ago
ExplainableML / AVCA-GZSL
This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …
☆37Updated 2 years ago
sangho-vision / avbert
☆31Updated 3 years ago
yunyikristy / CM-ACC
Cross-model active contrastive coding
☆22Updated 4 years ago
roudimit / AVLnet
Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.
☆52Updated 3 years ago
yangxuntu / lxmertcatt
☆74Updated 2 years ago
amazon-science / crossmodal-contrastive-learning
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
☆64Updated 3 years ago
brian7685 / Multimodal-Clustering-Network
ICCV 2021
☆33Updated 3 years ago
GeWu-Lab / CSOL_TPAMI2021
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
☆29Updated 3 years ago
boheumd / A2Summ
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
☆78Updated 2 years ago
DTaoo / Discriminative-Sounding-Objects-Localization
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆58Updated 3 years ago
akashe / Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
☆73Updated 4 years ago
GenjiB / LAVISH
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆102Updated last year
zhixiongz / CLIP4CMR
A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
☆42Updated 3 years ago
YapengTian / AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆186Updated 4 years ago
andreineculai / MPC
☆24Updated 3 years ago
Yu-Wu / Modaily-Aware-Audio-Visual-Video-Parsing
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
☆24Updated 3 years ago
IBM / AdaMML
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
☆51Updated 3 years ago
ExplainableML / TCAF-GZSL
This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"
☆24Updated 2 years ago
ioanacroi / qb-norm
Cross Modal Retrieval with Querybank Normalisation
☆55Updated last year
wenz116 / DRFT
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Updated 3 years ago
ed-fish / spatio-temporal-contrastive-film
Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning
☆32Updated 2 years ago
liudaizong / CSMGAN
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Updated 4 years ago
BestJuly / IIC
Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'
☆111Updated 4 years ago
naver-ai / pcme
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
☆133Updated last year
crodriguezo / TMLGA
Repository of proposal-free temporal moment localization work
☆33Updated last year