hsb1357173526 / Dynamic_Visual_PromptingLinks

☆5

Alternatives and similar repositories for Dynamic_Visual_Prompting

Users that are interested in Dynamic_Visual_Prompting are comparing it to the libraries listed below

Sorting:

thunlp / PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
☆48Updated 2 years ago
thunlp / CPT
Colorful Prompt Tuning for Pre-trained Vision-Language Models
☆49Updated 2 years ago
SivanDoveh / TSVLC
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆46Updated last year
zhangxi1997 / VQACL
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
☆39Updated last year
bruceyo / V-PETL
Towards a Unified View on Visual Parameter-Efficient Transfer Learning
☆26Updated 2 years ago
XLiu443 / Tem-adapter
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Updated last year
eric-ai-lab / CPL
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆34Updated 2 years ago
Yuqifan1117 / HalluciDoctor
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)
☆45Updated last year
LeapLabTHU / Cross-Modal-Adapter
[arXiv] Cross-Modal Adapter for Text-Video Retrieval
☆55Updated 2 years ago
aimagelab / PMA-Net
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
☆18Updated last year
chunmeifeng / SPRC
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆83Updated last year
cvlab-columbia / DoubleRight
☆27Updated last year
zjuchenlong / WSAG
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Updated last year
szzexpoi / POEM
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Updated last year
vishaal27 / SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆103Updated last year
UniAdapter / UniAdapter
☆23Updated 2 years ago
TencentARC / FLM
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆32Updated 2 years ago
yfzhang114 / LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…
☆78Updated 4 months ago
PVIT-official / PVIT
Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
☆37Updated last year
leolee99 / PAU
The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…
☆26Updated last year
whwu95 / FreeVA
FreeVA: Offline MLLM as Training-Free Video Assistant
☆60Updated last year
linzhiqiu / visual_gpt_score
VisualGPTScore for visio-linguistic reasoning
☆27Updated last year
wuw2019 / R-AMT
☆20Updated last year
arijitray1993 / COLA
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆24Updated 7 months ago
bbbdylan / proda
[CVPR2022] PyTorch re-implementation of Prompt Distribution Learning
☆18Updated 2 years ago
megvii-research / protoclip
📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)
☆53Updated last year
joeyz0z / MeaCap
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
☆48Updated 11 months ago
GasolSun36 / MVP
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆22Updated 10 months ago
takomc / amp
【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"
☆19Updated 9 months ago
sail-sg / ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆152Updated 2 years ago