JindongGu / Awesome-Prompting-on-Vision-Language-ModelLinks

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

☆500

Alternatives and similar repositories for Awesome-Prompting-on-Vision-Language-Model

Users that are interested in Awesome-Prompting-on-Vision-Language-Model are comparing it to the libraries listed below

Sorting:

ttengwang / Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
☆925Updated last year
DirtyHarryLYL / LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
☆872Updated 8 months ago
zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
☆701Updated 2 months ago
awaisrauf / Awesome-CV-Foundational-Models
☆532Updated last year
muzairkhattak / multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
☆784Updated 2 years ago
showlab / Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
☆895Updated last month
jianghaojun / Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
☆410Updated last year
Computer-Vision-in-the-Wild / CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
☆1,343Updated last year
gaopengcuhk / Tip-Adapter
☆643Updated last year
gaopengcuhk / CLIP-Adapter
☆555Updated 3 years ago
tsb0601 / MMVP
☆355Updated last year
friedrichor / Awesome-Multimodal-Papers
A curated list of awesome Multimodal studies.
☆286Updated 2 weeks ago
Yutong-Zhou-cv / Awesome-Multimodality
A Survey on multimodal learning research.
☆333Updated 2 years ago
KMnP / vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
☆1,184Updated 2 years ago
zhengli97 / PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
☆336Updated 2 months ago
ttengwang / Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
☆325Updated last month
haokunwen / Awesome-Composed-Image-Retrieval
Collection of Composed Image Retrieval (CIR) papers.
☆272Updated last week
SunzeY / AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
☆852Updated 3 months ago
muzairkhattak / PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…
☆279Updated 2 years ago
Atomic-man007 / Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…
☆346Updated 7 months ago
zjysteven / VLM-Visualizer
Visualizing the attention of vision-language models
☆252Updated 8 months ago
yossigandelsman / clip_text_span
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
☆232Updated 5 months ago
DAMO-NLP-SG / VCD
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆339Updated last year
NishilBalar / Awesome-LVLM-Hallucination
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
☆208Updated last month
deepcs233 / Visual-CoT
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …
☆396Updated 10 months ago
xmed-lab / CLIP_Surgery
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
☆447Updated 8 months ago
zhaochen0110 / Awesome_Think_With_Images
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,094Updated last month
zli12321 / Vision-Language-Models-Overview
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.
☆433Updated 2 weeks ago
Charles-Xie / awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆327Updated last week
mertyg / vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …
☆286Updated 2 years ago