shicaiwei123/ICCV2025-GDL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shicaiwei123/ICCV2025-GDL)

shicaiwei123 / ICCV2025-GDL

The official code for Boosting Multimodal Learning via Disentangled Gradient Learning

☆48

Alternatives and similar repositories for ICCV2025-GDL

Users that are interested in ICCV2025-GDL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / InfoReg_CVPR2025
View on GitHub
This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.
☆24Dec 22, 2025Updated 7 months ago
shicaiwei123 / ICCV2025-ARL
View on GitHub
The official code for Improving Multimodal Learning via Imbalanced Learning
☆40Mar 26, 2026Updated 3 months ago
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year
GeWu-Lab / BalanceBenchmark
View on GitHub
☆40Feb 23, 2025Updated last year
GeWu-Lab / awesome-balanced-multimodal-learning
View on GitHub
A curated list of balanced multimodal learning methods.
☆170Mar 26, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
zrguo / CASP
View on GitHub
[AAAI 2025] Official PyTorch implementation of the paper "Bridging the Gap for Test-Time Multimodal Sentiment Analysis"
☆54Feb 21, 2025Updated last year
VisualAIKHU / Missing-AVQA
View on GitHub
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
☆16Oct 29, 2024Updated last year
aimagelab / MissRAG
View on GitHub
[ICCV 2025] MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
☆26May 12, 2026Updated 2 months ago
mhxu1998 / FlexCare
View on GitHub
KDD 2024 | FlexCare: Leveraging Cross-Task Synergy for Flexible Multimodal Healthcare Prediction
☆18Sep 4, 2024Updated last year
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
MengShen0709 / bmmal
View on GitHub
[ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification
☆17Sep 25, 2023Updated 2 years ago
tub-cv-group / conclugen
View on GitHub
Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".
☆26Jan 10, 2025Updated last year
wxxv / MoMKE
View on GitHub
Code for "Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning" accepted by ACM Multimedia 2024
☆47Jan 15, 2025Updated last year
taco-group / DecAlign
View on GitHub
[ICLR 2026] DecAlign: Aligning Cross-Modal Semantics for Multimodal Foundation Models
☆106Jul 2, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
juices6 / NLIN
View on GitHub
Natural Language-centered Inference Network for Multi-modal Fake News Detection
☆12Sep 23, 2024Updated last year
GeWu-Lab / OGM-GE_CVPR2022
View on GitHub
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
☆320Sep 22, 2025Updated 10 months ago
LuoMSen / KAN-MCP
View on GitHub
☆28Aug 3, 2025Updated 11 months ago
ChenxiLiu-HNU / CM2TS
View on GitHub
[IJCAI 2025] Official implementation of "Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era"
☆16Jun 23, 2025Updated last year
777pomingzi / Rethinking-PLM-in-RS
View on GitHub
Codebase for RecSys 2024 paper, The Elephant in the Room: Rethinking the Usage of Pre-trained Language Model in Sequential Recommendation
☆19Aug 7, 2024Updated last year
Xu107 / MMHCL
View on GitHub
[ACM TOMM'2025] "MMHCL: Multi-Modal Hypergraph Contrastive Learning for Recommendation"
☆31Aug 13, 2025Updated 11 months ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
Indolent-Kawhi / EAGER-LLM
View on GitHub
A decoder-only llm-based generative recommendation framework that integrates endogenous and exogenous behavioral and semantic information…
☆16Mar 14, 2025Updated last year
SubmissionsIn / MVCAN
View on GitHub
Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios
☆12Mar 21, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
sayarghoshroy / Summaformers
View on GitHub
Code for our Paper, 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop
☆12Feb 10, 2021Updated 5 years ago
luweihai / DAMMFND
View on GitHub
The code repository for the AAAI 2025 paper titled "DAMMFND: Domain-Aware Multimodal Multi-view Fake News Detection"
☆48May 5, 2025Updated last year
HHalva / snica
View on GitHub
Code for the paper 'Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA' @ Neurips'21
☆21Feb 12, 2025Updated last year
maxischuh / BarlowDTI
View on GitHub
Accurate prediction of drug–target interactions in drug discovery.
☆11Dec 9, 2025Updated 7 months ago
GeWu-Lab / PSTP-Net
View on GitHub
☆17Aug 11, 2023Updated 2 years ago
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
simpleshinobu / visdial-principles
View on GitHub
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Feb 19, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zrguo / CGGM
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of the paper "Classifier-guided Gradient Modulation for Enhanced Multimodal Learning"
☆38Oct 10, 2024Updated last year
kaistmm / AVCD
View on GitHub
[NeurIPS 2025] AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding
☆27Nov 3, 2025Updated 8 months ago
ImKeTT / ReSee
View on GitHub
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Dec 4, 2023Updated 2 years ago
mdswyz / DiCMoR
View on GitHub
An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)
☆37Sep 28, 2023Updated 2 years ago
LuckyDaydreamer / SSLCL
View on GitHub
SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations
☆15Jul 27, 2024Updated last year
YetZzzzzz / GLoMo
View on GitHub
Code for GLoMo: Global-Local Modality Fusion for Multimodal Sentiment Analysis, which is accepted by ACM MM 24.
☆39Dec 30, 2024Updated last year
LiShuailzn / Neurips-2025-EFB-EMVC
View on GitHub
[NeurIPS 2025 (Spotlight)] Evolutionary Multi-View Classification via Eliminating Individual Fitness Bias
☆19Dec 4, 2025Updated 7 months ago