This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.
☆293Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for paco
Users that are interested in paco are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆135Mar 20, 2025Updated last year
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆393Sep 19, 2023Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 3 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,341Oct 5, 2023Updated 2 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆190Mar 22, 2024Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Nov 2, 2022Updated 3 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆341Jan 8, 2024Updated 2 years ago
- Grounded Language-Image Pre-training☆2,585Jan 24, 2024Updated 2 years ago
- [ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation☆63Sep 2, 2024Updated last year
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆114Mar 2, 2026Updated 3 weeks ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆103May 26, 2023Updated 2 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Jun 13, 2023Updated 2 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,996Mar 21, 2024Updated 2 years ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆784May 10, 2022Updated 3 years ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆506Aug 9, 2024Updated last year
- Detection Transformers with Assignment☆263Sep 16, 2023Updated 2 years ago
- SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)☆349Aug 2, 2022Updated 3 years ago
- Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…☆1,061Jun 4, 2025Updated 9 months ago
- ☆11Jan 18, 2024Updated 2 years ago
- This repository contains code and tools for reading, processing, evaluating on, and visualizing Panoptic Parts datasets. Moreover, it con…☆105May 1, 2022Updated 3 years ago
- ☆164Apr 6, 2023Updated 2 years ago
- Open-vocabulary Object Segmentation with Diffusion Models☆183Aug 15, 2023Updated 2 years ago
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,701Oct 3, 2024Updated last year
- Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]☆934Jul 6, 2024Updated last year
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆155Aug 19, 2023Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- ☆13Jul 20, 2024Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.☆753Oct 17, 2023Updated 2 years ago
- (ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest☆551Jun 3, 2025Updated 9 months ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆317Aug 7, 2023Updated 2 years ago
- Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and…☆616Feb 21, 2024Updated 2 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆237Aug 3, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,813Jul 10, 2025Updated 8 months ago
- Pixel-ImageNet☆45Feb 24, 2022Updated 4 years ago
- Official Repository of ChatCaptioner☆468Apr 13, 2023Updated 2 years ago