Ali2500 / BURST-benchmark
☆71Updated last year
Related projects: ⓘ
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆70Updated 11 months ago
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆109Updated 6 months ago
- Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)☆90Updated 8 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆73Updated 2 months ago
- VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)☆102Updated 8 months ago
- ☆87Updated 2 months ago
- [CVPR'23] A Generalized Framework for Video Instance Segmentation☆125Updated 8 months ago
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆149Updated last year
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆133Updated last year
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆179Updated 5 months ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆104Updated 4 months ago
- Recognize Any Regions☆115Updated 9 months ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆203Updated 2 years ago
- ☆47Updated last year
- Associating Objects with Transformers for Video Object Segmentation☆128Updated 5 months ago
- ☆93Updated 3 months ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆29Updated last year
- COCO API Customized for OVIS evaluation☆13Updated 2 years ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆144Updated 2 years ago
- Tracking with Human-Intent Reasoning☆63Updated 8 months ago
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆52Updated 2 months ago
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆39Updated 2 years ago
- Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"☆33Updated 11 months ago
- Code for the paper "Visual Recognition by Request".☆44Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆93Updated last year
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated last year
- [ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation☆53Updated 2 weeks ago
- OVSegmentor, CVPR23☆53Updated 4 months ago
- DVIS: Decoupled Video Instance Segmentation Framework☆124Updated 5 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆88Updated 2 months ago