VisualAIKHU / Missing-AVQAView external linksLinks
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
☆15Oct 29, 2024Updated last year
Alternatives and similar repositories for Missing-AVQA
Users that are interested in Missing-AVQA are comparing it to the libraries listed below
Sorting:
- Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)☆18Nov 14, 2023Updated 2 years ago
- Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)☆13Sep 1, 2024Updated last year
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated 11 months ago
- ☆17Aug 11, 2023Updated 2 years ago
- Official Repository for "Multispectral Pedestrian Detection with Sparsely Annotated Label" (AAAI 2025)☆29Apr 28, 2025Updated 9 months ago
- The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024☆19Sep 29, 2024Updated last year
- ☆25Apr 16, 2025Updated 9 months ago
- An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)☆62Dec 5, 2023Updated 2 years ago
- ☆34Jul 25, 2024Updated last year
- ☆27Aug 2, 2023Updated 2 years ago
- [CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".☆42Jun 5, 2025Updated 8 months ago
- ☆40Apr 16, 2024Updated last year
- A Lightweight Multi-modality Image Segmentation Network via Domain Adaptation using Gradient Magnitude and Shape Constraint☆10Apr 3, 2023Updated 2 years ago
- MUSIC-AVQA, CVPR2022 (ORAL)☆94Dec 30, 2022Updated 3 years ago
- ☆11Aug 20, 2025Updated 5 months ago
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆20Updated this week
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- ☆12Apr 19, 2024Updated last year
- This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.☆20Dec 22, 2025Updated last month
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 8 months ago
- [NeurIPS 2025] TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving☆29Dec 13, 2025Updated 2 months ago
- ☆13May 21, 2024Updated last year
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated 11 months ago
- ☆16Aug 15, 2024Updated last year
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆17Aug 11, 2024Updated last year
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated last year
- ☆11Mar 24, 2025Updated 10 months ago
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated 10 months ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 3 months ago
- SwinTransformer for Tensorflow2☆12Jul 7, 2022Updated 3 years ago
- Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)☆60Oct 18, 2024Updated last year
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 3 years ago
- ☆13Apr 2, 2025Updated 10 months ago
- Official Pytorch Implementation for "TextToucher: Fine-Grained Text-to-Touch Generation" (AAAI 2025)☆18Jan 28, 2026Updated 2 weeks ago
- ☆14Dec 31, 2024Updated last year