The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper useful, please give us a citation.
☆50Jul 18, 2023Updated 2 years ago
Alternatives and similar repositories for DPT
Users that are interested in DPT are comparing it to the libraries listed below
Sorting:
- ☆61May 2, 2025Updated 10 months ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated last year
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆84May 24, 2024Updated last year
- Model calibration in CLIP Adapters☆19Aug 19, 2024Updated last year
- ☆105Dec 7, 2023Updated 2 years ago
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆53Oct 7, 2024Updated last year
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆56Sep 26, 2024Updated last year
- Noise Contrastive Test-Time Training☆12Mar 11, 2024Updated last year
- Official code for "IT³: Idempotent Test-Time Training" (ICML 2025)☆14Jun 25, 2025Updated 8 months ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated 11 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆95Apr 24, 2025Updated 10 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- This repository contains the code for our CVPR 2024 paper,☆15Aug 27, 2024Updated last year
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- ☆13Jul 17, 2024Updated last year
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,215Sep 2, 2023Updated 2 years ago
- ☆36Nov 4, 2022Updated 3 years ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 5 months ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆22Nov 17, 2025Updated 3 months ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆56Jun 5, 2024Updated last year
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆56Jul 9, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34May 8, 2024Updated last year
- Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)☆2,179May 20, 2024Updated last year
- ☆14Oct 31, 2022Updated 3 years ago
- ☆16May 26, 2023Updated 2 years ago
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]☆15Sep 23, 2023Updated 2 years ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆17Aug 19, 2025Updated 6 months ago
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"☆115Jul 15, 2024Updated last year
- [NeurIPS 2023] Generalized Logit Adjustment☆39Apr 21, 2024Updated last year
- ☆21Oct 9, 2025Updated 4 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆20Sep 21, 2024Updated last year
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆69Oct 1, 2022Updated 3 years ago