lihong2303 / AGMLinks

[ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".

☆25

Alternatives and similar repositories for AGM

Users that are interested in AGM are comparing it to the libraries listed below

Sorting:

zihuixue / MFH
[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
☆45Updated last year
GeWu-Lab / MMPareto_ICML2024
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
☆42Updated 11 months ago
GeWu-Lab / Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
☆52Updated 7 months ago
fanyunfeng-bit / Modal-Imbalance-PMR
PMR: Prototypical Modal Rebalance for Multimodal Learning
☆38Updated 2 years ago
GeWu-Lab / Certifiable-Robust-Multi-modal-Training
A python implement for Certifiable Robust Multi-modal Training
☆19Updated 10 months ago
knightyxp / DGL
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆41Updated 7 months ago
UniAdapter / UniAdapter
☆23Updated 2 years ago
huacong / ReconBoost
ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement
☆23Updated last month
GeWu-Lab / Diagnosing_Relearning_ECCV2024
The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024
☆24Updated 10 months ago
JackYFL / EmoLA
The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"
☆26Updated 5 months ago
IIGROUP / MAP
☆36Updated 2 years ago
YingWANGG / M2IB
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
☆50Updated last year
ZhengYu518 / VL-Mamba
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
☆81Updated last year
yaolinli / DeCo
☆37Updated 11 months ago
ThomasWangY / 2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
☆74Updated 4 months ago
mlvlab / RPO
Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023
☆53Updated last year
Ruiyang-061X / VL-Uncertainty
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
☆35Updated 2 months ago
zycheiheihei / Transferable-Visual-Prompting
[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…
☆43Updated 5 months ago
shicaiwei123 / ECCV2024-DMRNet
Code for dmrnet
☆24Updated 2 weeks ago
BeierZhu / GLA
[NeurIPS 2023] Generalized Logit Adjustment
☆37Updated last year
machengcheng2016 / Subspace-Prompt-Learning
Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)
☆28Updated last year
yuxiaochen1103 / FDT
☆61Updated last year
xu5zhao / BiCro
☆27Updated 2 years ago
CHENGY12 / PLOT
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
☆166Updated last year
JiuTian-VL / MoME
[NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
☆65Updated last month
silicx / LoRS_Distill
Code for our ICML'24 on multimodal dataset distillation
☆37Updated 7 months ago
deep-real / DEAL
The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)
☆20Updated 7 months ago
Ruiyang-061X / Awesome-MLLM-Uncertainty
✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).
☆45Updated 2 months ago
leolee99 / PAU
The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…
☆26Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year