amirbar / visual_prompting
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".
☆310Updated last year
Alternatives and similar repositories for visual_prompting:
Users that are interested in visual_prompting are comparing it to the libraries listed below
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆173Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆178Updated last year
- Is synthetic data from generative models ready for image recognition?☆182Updated 2 years ago
- Exploring Visual Prompts for Adapting Large-Scale Models☆277Updated 2 years ago
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆214Updated 2 years ago
- Learning from synthetic data - code and models☆314Updated last year
- Research code for paper "Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"☆112Updated 5 months ago
- Densely Captioned Images (DCI) dataset repository.☆177Updated 9 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆164Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆425Updated 11 months ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆439Updated 2 years ago
- [ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching☆253Updated last year
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆131Updated 2 years ago
- ☆107Updated 2 months ago
- An official PyTorch implementation of the CRIS paper☆271Updated 10 months ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆276Updated last year
- CLIPScore EMNLP code☆221Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆171Updated 2 years ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆174Updated last year
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆124Updated last month
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆258Updated 11 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated 10 months ago
- ☆186Updated last year
- Augmenting with Language-guided Image Augmentation (ALIA)