Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
☆33Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for pi-Tuning
Users that are interested in pi-Tuning are comparing it to the libraries listed below
Sorting:
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- [NeurIPS 2022] VisDA 2022 Challenge Toolkit☆20Oct 1, 2022Updated 3 years ago
- ☆23Aug 17, 2024Updated last year
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated 9 months ago
- Code for the ICCV 2023 paper "Benchmarking Low-Shot Robustness to Natural Distribution Shifts"☆11Jan 21, 2024Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆238Dec 8, 2023Updated 2 years ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆21Mar 5, 2026Updated 2 weeks ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆168Jul 15, 2023Updated 2 years ago
- ☆37May 7, 2023Updated 2 years ago
- ☆24Jun 18, 2025Updated 9 months ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆30Apr 7, 2023Updated 2 years ago
- ☆40Dec 16, 2025Updated 3 months ago
- ☆66Feb 4, 2026Updated last month
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- Source code for the NAACL 2021 paper: "Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors"☆12Jul 15, 2021Updated 4 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- ☆20Oct 19, 2023Updated 2 years ago
- AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models☆60Apr 8, 2024Updated last year
- ☆19Feb 2, 2026Updated last month
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Nov 30, 2022Updated 3 years ago
- ☆27Mar 20, 2023Updated 3 years ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting, (ICCV'21)☆14Aug 4, 2022Updated 3 years ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Oct 3, 2024Updated last year
- Sample LaTex file for HKU PhD thesis.☆27Mar 16, 2022Updated 4 years ago
- [ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization☆42May 2, 2021Updated 4 years ago
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Jul 16, 2023Updated 2 years ago
- ☆14May 4, 2024Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- ☆14May 3, 2022Updated 3 years ago
- A curated list of zero-shot captioning papers☆24Aug 26, 2023Updated 2 years ago
- ☆16Dec 24, 2021Updated 4 years ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆93Aug 8, 2025Updated 7 months ago