wl-zhao / VPDLinks

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

☆531

Alternatives and similar repositories for VPD

Users that are interested in VPD are comparing it to the libraries listed below

Sorting:

Jiawei-Yang / Denoising-ViT
This is the official code release for our work, Denoising Vision Transformers.
☆386Updated last year
wysoczanska / clip_dinoiser
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
☆262Updated last year
prismformore / Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Py…
☆322Updated last year
JiYuanFeng / DDP
☆206Updated last year
chongzhou96 / MaskCLIP
Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)
☆464Updated 3 years ago
showlab / DatasetDM
[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models
☆322Updated 2 years ago
Junyi42 / sd-dino
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
☆341Updated last year
weijiawu / DiffuMask
[ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
☆186Updated 2 years ago
Tsingularity / dift
[NeurIPS'23] Emergent Correspondence from Image Diffusion
☆741Updated last year
SwinTransformer / MIM-Depth-Estimation
This is an official implementation of our CVPR 2023 paper "Revealing the Dark Secrets of Masked Image Modeling" on Depth Estimation.
☆174Updated 2 years ago
ma-xu / Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
☆574Updated last year
cvlab-kaist / CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
☆346Updated last year
cientgu / InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
☆440Updated last year
fudan-zvg / meta-prompts
☆74Updated 9 months ago
NVlabs / ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆930Updated last year
essunny310 / FreestyleNet
[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis
☆153Updated 2 years ago
fudan-zvg / GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
☆221Updated 2 years ago
aliasgharkhani / SLiMe
1-shot image segmentation using Stable Diffusion
☆142Updated last year
Lipurple / Grounded-Diffusion
Open-vocabulary Object Segmentation with Diffusion Models
☆181Updated 2 years ago
Haiyang-W / GiT
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
☆355Updated 10 months ago
MendelXu / SAN
Open-vocabulary Semantic Segmentation
☆363Updated last year
amazon-science / c2f-seg
Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).
☆54Updated last year
u2seg / U2Seg
[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"
☆226Updated last year
shinying / dmp
[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction
☆80Updated last year
damaggu / TADP
Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024
☆40Updated last year
diffusion-hyperfeatures / diffusion_hyperfeatures
Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023
☆109Updated last year
CompVis / zigma
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
☆339Updated 8 months ago
SwinTransformer / AiT
☆109Updated 2 years ago
bytedance / fc-clip
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆331Updated last year
berkeley-hipie / HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
☆292Updated 5 months ago