The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper useful, please give us a citation.
☆51Jul 18, 2023Updated 2 years ago
Alternatives and similar repositories for DPT
Users that are interested in DPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of our AAAI 2026 paper, "YOLO-IOD: Towards Real Time Incremental Object Detection"☆41Apr 14, 2026Updated 2 months ago
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆86May 24, 2024Updated 2 years ago
- ☆61May 2, 2025Updated last year
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34May 8, 2024Updated 2 years ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆818Jul 24, 2023Updated 2 years ago
- Model calibration in CLIP Adapters☆20Aug 19, 2024Updated last year
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,235Sep 2, 2023Updated 2 years ago
- ☆110Dec 7, 2023Updated 2 years ago
- Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)☆2,212May 20, 2024Updated 2 years ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆12May 26, 2024Updated 2 years ago
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆57Jun 5, 2024Updated 2 years ago
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]☆14Sep 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆45Jul 1, 2024Updated 2 years ago
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆69Oct 1, 2022Updated 3 years ago
- [NeurIPS 2023] Generalized Logit Adjustment☆40Apr 21, 2024Updated 2 years ago
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆56Oct 7, 2024Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆24Aug 19, 2025Updated 10 months ago
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆58Sep 26, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- ☆579Jul 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆36Nov 4, 2022Updated 3 years ago
- Global Reasoning unit (GloRe)☆19May 20, 2019Updated 7 years ago
- [ACL 2023] Delving into the Openness of CLIP☆24Jan 11, 2023Updated 3 years ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆100Apr 24, 2025Updated last year
- ☆56Oct 5, 2022Updated 3 years ago
- ☆14Apr 7, 2024Updated 2 years ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆57Jul 9, 2024Updated last year
- ☆14Oct 31, 2022Updated 3 years ago
- ☆678Nov 28, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆177Dec 14, 2023Updated 2 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- [AAAI 2025] Official Implementation of I-HallA v1.0☆16Feb 2, 2025Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆76May 27, 2023Updated 3 years ago
- Repo of NeurIPS23☆17Oct 25, 2023Updated 2 years ago