shicaiwei123 / ICCV2025-GDLLinks
The official code for Boosting Multimodal Learning via Disentangled Gradient Learning
☆33Updated 2 months ago
Alternatives and similar repositories for ICCV2025-GDL
Users that are interested in ICCV2025-GDL are comparing it to the libraries listed below
Sorting:
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆15Updated last year
- TCL-MAP is a powerful method for multimodal intent recognition (AAAI 2024)☆55Updated 2 years ago
- Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)☆23Updated 11 months ago
- Code for dmrnet☆29Updated 6 months ago
- [CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation☆80Updated last month
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆72Updated last year
- [CVPR'25] EMOE: Modality-Specific Enhanced Dynamic Emotion Experts☆107Updated 6 months ago
- Official PyTorch repository for GRAM☆112Updated 8 months ago
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆78Updated last week
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆26Updated 6 months ago
- Latest Papers, Codes and Datasets on VTG-LLMs.☆77Updated 2 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆306Updated 4 months ago
- Code for the paper 'Dynamic Multimodal Fusion'☆122Updated 2 years ago
- A curated list of balanced multimodal learning methods.☆152Updated 2 weeks ago
- Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025☆47Updated 5 months ago
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆70Updated last year
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆28Updated 8 months ago
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆27Updated 9 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆49Updated 3 months ago
- ☆83Updated 9 months ago
- ☆24Updated last year
- The official code for Improving Multimodal Learning via Imbalanced Learning☆21Updated last month
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Updated last year
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆226Updated 2 years ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Updated last year
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Updated 9 months ago
- Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Pape…☆54Updated 11 months ago
- Universal Video Temporal Grounding with Generative Multi-modal Large Language Models☆44Updated 2 months ago
- A python implement for Certifiable Robust Multi-modal Training☆19Updated 7 months ago