Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano. Tech Report. 2022
☆44Sep 11, 2024Updated last year
Alternatives and similar repositories for PGN
Users that are interested in PGN are comparing it to the libraries listed below
Sorting:
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆56Jun 5, 2024Updated last year
- Exploring Visual Prompts for Adapting Large-Scale Models☆289Jun 6, 2022Updated 3 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Mar 11, 2023Updated 2 years ago
- ☆19Jan 2, 2023Updated 3 years ago
- Source code for "MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation", MIDL 2023, https:/…☆10Apr 29, 2023Updated 2 years ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆168Jul 15, 2023Updated 2 years ago
- ☆13Apr 30, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- LAEO-Net++☆21Mar 24, 2021Updated 4 years ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,214Sep 2, 2023Updated 2 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 3 years ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆20Jul 2, 2025Updated 8 months ago
- Benchmark for single-view 3D reconstructions of articulated animals. 3DV 2024☆22Jul 1, 2024Updated last year
- Multimodal Neurons in Artificial Neural Networks☆16Oct 18, 2021Updated 4 years ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- Code accompanying "Adaptive Methods for Aggregated Domain Generalization"☆18Dec 11, 2021Updated 4 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Oct 9, 2023Updated 2 years ago
- ☆200May 10, 2023Updated 2 years ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆198Aug 1, 2023Updated 2 years ago
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆199Sep 3, 2023Updated 2 years ago
- ☆27Mar 3, 2025Updated last year
- vit for few-shot classification☆47Mar 24, 2023Updated 2 years ago
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- ☆28Apr 8, 2025Updated 11 months ago
- Colorful Prompt Tuning for Pre-trained Vision-Language Models☆49Nov 1, 2022Updated 3 years ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆55Aug 19, 2023Updated 2 years ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆27Nov 9, 2024Updated last year
- [NeurIPS2024] Overcome hallucination of diffusion restoration models.☆65Apr 14, 2025Updated 10 months ago
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆29Jan 28, 2025Updated last year
- A curated list of prompt-based paper in computer vision and vision-language learning.☆925Dec 18, 2023Updated 2 years ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆174Dec 14, 2023Updated 2 years ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- This repo contains code for Invariant Grounding for Video Question Answering☆27Mar 2, 2023Updated 3 years ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆31Oct 1, 2024Updated last year
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆28Jan 26, 2024Updated 2 years ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago