lihong2303 / AGMLinks
[ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".
☆25Updated last year
Alternatives and similar repositories for AGM
Users that are interested in AGM are comparing it to the libraries listed below
Sorting:
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆45Updated last year
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆42Updated 11 months ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆52Updated 7 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆38Updated 2 years ago
- A python implement for Certifiable Robust Multi-modal Training☆19Updated 10 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 7 months ago
- ☆23Updated 2 years ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆23Updated last month
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆24Updated 10 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆26Updated 5 months ago
- ☆36Updated 2 years ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆50Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- ☆37Updated 11 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 4 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆53Updated last year
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆35Updated 2 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆43Updated 5 months ago
- Code for dmrnet☆24Updated 2 weeks ago
- [NeurIPS 2023] Generalized Logit Adjustment☆37Updated last year
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆28Updated last year
- ☆61Updated last year
- ☆27Updated 2 years ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆166Updated last year
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆65Updated last month
- Code for our ICML'24 on multimodal dataset distillation☆37Updated 7 months ago
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Updated 7 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆45Updated 2 months ago
- The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…☆26Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆28Updated last year