sukjunhwang / VITALinks
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
☆102Updated last year
Alternatives and similar repositories for VITA
Users that are interested in VITA are comparing it to the libraries listed below
Sorting:
- [CVPR'23] A Generalized Framework for Video Instance Segmentation☆130Updated last year
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆78Updated last year
- ☆78Updated last year
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆40Updated 2 years ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆88Updated 2 years ago
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated last year
- Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)☆93Updated last year
- Code for the VOST dataset☆26Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆90Updated last year
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆41Updated 3 years ago
- ☆37Updated 2 years ago
- Accepted by CVPR 2022☆36Updated 3 years ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆51Updated last year
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆144Updated 2 years ago
- Associating Objects with Transformers for Video Object Segmentation☆141Updated last year
- Tracking with Human-Intent Reasoning☆72Updated 9 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆55Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Recognize Any Regions☆122Updated 7 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆190Updated last year
- Fast and general video object segmentation evaluation.☆33Updated last year
- COCO API Customized for OVIS evaluation☆14Updated 3 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆189Updated last year
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆93Updated 2 years ago
- [CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"☆126Updated 4 months ago
- ☆124Updated last year
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆153Updated last year
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆182Updated last year