GeWu-Lab / MMCosine_ICASSP23Links
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
☆24Updated 2 years ago
Alternatives and similar repositories for MMCosine_ICASSP23
Users that are interested in MMCosine_ICASSP23 are comparing it to the libraries listed below
Sorting:
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆31Updated 2 years ago
- ☆12Updated 2 years ago
- Towards Long Form Audio-visual Video Understanding☆14Updated 8 months ago
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆41Updated 3 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Updated 2 years ago
- MUSIC-AVQA, CVPR2022 (ORAL)☆90Updated 2 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Updated last year
- A list of current Audio-Vision Multimodal with awesome resources (paper, application, data, review, survey, etc.).