OpenNLPLab / MMVAE-AVS
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆16Updated 11 months ago
Related projects: ⓘ
- ☆19Updated 9 months ago
- Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024☆13Updated 5 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆25Updated 2 months ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆82Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆52Updated 5 months ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆31Updated last month
- ☆17Updated 6 months ago
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆33Updated last year
- Official implementation for MGN☆20Updated last year
- Code for dmrnet☆10Updated last month
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆17Updated last year
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆21Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆21Updated last week
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆16Updated last month
- An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)☆30Updated 9 months ago
- ICCV 2021☆31Updated 2 years ago
- [2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆22Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆32Updated 2 months ago
- PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…☆30Updated 2 months ago
- Official implementation for AVGN☆32Updated last year
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆29Updated last year
- The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.☆29Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆86Updated 3 years ago
- A python implement for Certifiable Robust Multi-modal Training☆14Updated last month
- Official implementation for CIGN☆14Updated last year
- Multimodal Learning Method MLA for CVPR 2024☆36Updated 3 months ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆14Updated last year
- ☆12Updated 6 months ago
- Multi-Scale Attention for Audio Question Answering☆24Updated last year
- ☆9Updated 2 years ago