amirbar / visual_prompting
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".
☆301Updated last year
Related projects ⓘ
Alternatives and complementary repositories for visual_prompting
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆166Updated 8 months ago
- Is synthetic data from generative models ready for image recognition?☆175Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆172Updated last year
- Exploring Visual Prompts for Adapting Large-Scale Models☆265Updated 2 years ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆258Updated 9 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆124Updated last year
- Densely Captioned Images (DCI) dataset repository.☆158Updated 4 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆153Updated 11 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- Open source implementation of "Vision Transformers Need Registers"☆141Updated this week
- ☆65Updated last year
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆209Updated last year
- Research code for paper "Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"☆112Updated this week
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆222Updated 11 months ago
- [ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching☆251Updated last year
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆164Updated last month
- ☆91Updated 5 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆154Updated last year
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆82Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆104Updated 7 months ago
- ☆170Updated last year
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆92Updated 2 years ago
- ☆109Updated 4 months ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆403Updated 2 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆106Updated 2 years ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆389Updated 5 months ago
- An official PyTorch implementation of the CRIS paper☆250Updated 5 months ago
- Learning from synthetic data - code and models☆301Updated 10 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆62Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆134Updated last year