ThomasWangY / 2024-AAAI-HPTView external linksLinks
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
☆73Feb 3, 2025Updated last year
Alternatives and similar repositories for 2024-AAAI-HPT
Users that are interested in 2024-AAAI-HPT are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆804Jul 24, 2023Updated 2 years ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆284Sep 28, 2023Updated 2 years ago
- ☆32Mar 7, 2024Updated last year
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆85May 24, 2024Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Jan 3, 2024Updated 2 years ago
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Feb 8, 2024Updated 2 years ago
- ☆13Jul 17, 2024Updated last year
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆58Sep 3, 2024Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆55Aug 19, 2023Updated 2 years ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆103Mar 6, 2024Updated last year
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆31Oct 1, 2024Updated last year
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆56Sep 26, 2024Updated last year
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆70Jan 15, 2025Updated last year
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆167Jul 15, 2023Updated 2 years ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Dec 27, 2023Updated 2 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Nov 7, 2024Updated last year
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆27Feb 2, 2025Updated last year
- The efficient tuning method for VLMs☆80Mar 10, 2024Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Nov 24, 2025Updated 2 months ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆751Dec 1, 2025Updated 2 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Dec 14, 2025Updated 2 months ago
- 【CVPR'24 】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆99Oct 20, 2025Updated 3 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- ☆120Feb 19, 2024Updated last year
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- ☆42Apr 7, 2024Updated last year
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆41Jul 1, 2024Updated last year
- ☆22Dec 28, 2024Updated last year
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 7 months ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆14Jan 5, 2022Updated 4 years ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆11Aug 11, 2025Updated 6 months ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year