NVlabs/ODISE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVlabs/ODISE)

NVlabs / ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

☆945

Alternatives and similar repositories for ODISE

Users that are interested in ODISE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / ov-seg
View on GitHub
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
☆759Oct 17, 2023Updated 2 years ago
IDEA-Research / OpenSeeD
View on GitHub
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆762Jan 22, 2024Updated 2 years ago
MendelXu / SAN
View on GitHub
Open-vocabulary Semantic Segmentation
☆384Oct 16, 2024Updated last year
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
bytedance / fc-clip
View on GitHub
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆345Feb 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chongzhou96 / MaskCLIP
View on GitHub
Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)
☆480Sep 19, 2022Updated 3 years ago
wl-zhao / VPD
View on GitHub
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to do…
☆540Dec 21, 2023Updated 2 years ago
weijiawu / DiffuMask
View on GitHub
[ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
☆193Nov 1, 2023Updated 2 years ago
cvlab-kaist / CAT-Seg
View on GitHub
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
☆385Apr 11, 2024Updated 2 years ago
baaivision / Painter
View on GitHub
Painter & SegGPT Series: Vision Foundation Models from BAAI
☆2,593Dec 6, 2024Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,795Aug 19, 2024Updated last year
facebookresearch / Mask2Former
View on GitHub
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
☆3,415Jul 29, 2024Updated last year
berkeley-hipie / HIPIE
View on GitHub
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
☆294Jun 19, 2025Updated last year
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,853Jul 10, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
NVlabs / GroupViT
View on GitHub
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
☆788May 10, 2022Updated 4 years ago
gligen / GLIGEN
View on GitHub
Open-Set Grounded Text-to-Image Generation
☆2,226Mar 6, 2024Updated 2 years ago
JIA-Lab-research / LISA
View on GitHub
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
☆2,662Feb 16, 2025Updated last year
Lipurple / Grounded-Diffusion
View on GitHub
Open-vocabulary Object Segmentation with Diffusion Models
☆184Aug 15, 2023Updated 2 years ago
facebookresearch / VLPart
View on GitHub
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
☆395Sep 19, 2023Updated 2 years ago
mbzuai-oryx / groundingLMM
View on GitHub
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…
☆963Aug 5, 2025Updated 11 months ago
Junyi42 / sd-dino
View on GitHub
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
☆356Mar 29, 2024Updated 2 years ago
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,684Sep 5, 2024Updated last year
jianzongwu / Awesome-Open-Vocabulary
View on GitHub
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆998May 12, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
amazon-science / prompt-pretraining
View on GitHub
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259May 3, 2024Updated 2 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
facebookresearch / CutLER
View on GitHub
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…
☆1,071Apr 14, 2026Updated 3 months ago
fudan-zvg / GSS
View on GitHub
[CVPR 2023] Official repository of Generative Semantic Segmentation
☆222Sep 3, 2023Updated 2 years ago
MendelXu / zsseg.baseline
View on GitHub
Open-vocabulary Semantic Segmentation
☆185Mar 28, 2023Updated 3 years ago
Tsingularity / dift
View on GitHub
[NeurIPS'23] Emergent Correspondence from Image Diffusion
☆773May 14, 2024Updated 2 years ago
isl-org / lang-seg
View on GitHub
Language-Driven Semantic Segmentation
☆834Dec 18, 2024Updated last year
ziqihuangg / ReVersion
View on GitHub
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
☆503Oct 7, 2025Updated 9 months ago
cientgu / InstructDiffusion
View on GitHub
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
☆445May 14, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SHI-Labs / OneFormer
View on GitHub
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
☆1,731Oct 3, 2024Updated last year
dingjiansw101 / ZegFormer
View on GitHub
Official code for "Decoupling Zero-Shot Semantic Segmentation"
☆180Nov 30, 2022Updated 3 years ago
yandex-research / ddpm-segmentation
View on GitHub
Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)
☆718Apr 8, 2023Updated 3 years ago
MichalGeyer / plug-and-play
View on GitHub
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
☆1,001Jun 19, 2023Updated 3 years ago
ShoufaChen / DiffusionDet
View on GitHub
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
☆2,257Dec 22, 2022Updated 3 years ago
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,684Aug 1, 2024Updated last year
facebookresearch / Detic
View on GitHub
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆2,007Mar 21, 2024Updated 2 years ago