jochemloedeman / PGN
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano. Tech Report. 2022
☆41Updated 7 months ago
Alternatives and similar repositories for PGN:
Users that are interested in PGN are comparing it to the libraries listed below
- ☆56Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆39Updated 11 months ago
- ☆61Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆46Updated last year
- ☆16Updated 3 months ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆41Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 3 months ago
- ☆59Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆53Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆20Updated 4 months ago
- ☆64Updated last year
- ☆52Updated 2 years ago
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆27Updated last year
- [CVPR 2023] Improving Zero-shot Generalization and Robustness of Multi-modal Models☆32Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated last year
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆68Updated 2 years ago
- ☆29Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆36Updated last year
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- Create generated datasets and train robust classifiers☆34Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆82Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Updated last year
- On-Device Domain Generalization☆42Updated 2 years ago