hsb1357173526 / Dynamic_Visual_Prompting
β5Updated last year
Alternatives and similar repositories for Dynamic_Visual_Prompting:
Users that are interested in Dynamic_Visual_Prompting are comparing it to the libraries listed below
- π Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)β52Updated last year
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)β35Updated last year
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learningβ38Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learningβ26Updated 2 years ago
- Source code for EMNLP 2022 paper βPEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Modelsββ48Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!β24Updated 4 months ago
- β23Updated 2 years ago
- [ACL 2023] Delving into the Openness of CLIPβ23Updated 2 years ago
- [CVPR2022] PyTorch re-implementation of Prompt Distribution Learningβ18Updated last year
- β56Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)β32Updated last year
- β26Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Modelsβ46Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"β33Updated 2 years ago
- [arXiv] Cross-Modal Adapter for Text-Video Retrievalβ55Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"β53Updated last year
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"β13Updated last year
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding stratβ¦β77Updated last month
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]β97Updated last year
- β20Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)β20Updated 3 years ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)β44Updated 8 months ago
- This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.β15Updated 2 years ago
- β83Updated 2 years ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuningβ48Updated 10 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoningβ20Updated 6 months ago
- Colorful Prompt Tuning for Pre-trained Vision-Language Modelsβ49Updated 2 years ago
- β29Updated last year
- [EMNLP'22] Weakly-Supervised Temporal Article Groundingβ14Updated last year
- VisualGPTScore for visio-linguistic reasoningβ27Updated last year