This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.
☆291Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for paco
Users that are interested in paco are comparing it to the libraries listed below
Sorting:
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆135Mar 20, 2025Updated 11 months ago
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆393Sep 19, 2023Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 2 months ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,343Oct 5, 2023Updated 2 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆190Mar 22, 2024Updated last year
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆340Jan 8, 2024Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆114Jan 28, 2026Updated last month
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆505Aug 9, 2024Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- Grounded Language-Image Pre-training☆2,575Jan 24, 2024Updated 2 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Jun 13, 2023Updated 2 years ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆783May 10, 2022Updated 3 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆103May 26, 2023Updated 2 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,999Mar 21, 2024Updated last year
- Detection Transformers with Assignment☆265Sep 16, 2023Updated 2 years ago
- [ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation☆63Sep 2, 2024Updated last year
- Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…☆1,059Jun 4, 2025Updated 9 months ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆237Aug 3, 2022Updated 3 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- This repository contains code and tools for reading, processing, evaluating on, and visualizing Panoptic Parts datasets. Moreover, it con…☆105May 1, 2022Updated 3 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆245Dec 3, 2022Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]☆935Jul 6, 2024Updated last year
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,703Oct 3, 2024Updated last year
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)☆351Aug 2, 2022Updated 3 years ago
- Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"☆45Aug 25, 2022Updated 3 years ago
- Open-vocabulary Object Segmentation with Diffusion Models☆183Aug 15, 2023Updated 2 years ago
- Pixel-ImageNet☆45Feb 24, 2022Updated 4 years ago
- This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.☆752Oct 17, 2023Updated 2 years ago
- Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and…☆618Feb 21, 2024Updated 2 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆318Aug 7, 2023Updated 2 years ago
- ☆13Jul 20, 2024Updated last year
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- ☆164Apr 6, 2023Updated 2 years ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,647Aug 1, 2024Updated last year
- ☆278Dec 4, 2024Updated last year