sukjunhwang / VITALinks
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
☆104Updated last year
Alternatives and similar repositories for VITA
Users that are interested in VITA are comparing it to the libraries listed below
Sorting:
- [CVPR'23] A Generalized Framework for Video Instance Segmentation☆131Updated last year
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆77Updated last year
- Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)☆93Updated last year
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated last year
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆40Updated 3 years ago
- ☆78Updated 2 years ago
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆41Updated 3 years ago
- ☆37Updated 2 years ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆88Updated 2 years ago
- Large-Vocabulary Video Instance Segmentation dataset☆94Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Associating Objects with Transformers for Video Object Segmentation☆142Updated last year
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆144Updated 2 years ago
- Code for the VOST dataset☆26Updated 2 years ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated last year
- Accepted by CVPR 2022☆36Updated 3 years ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆54Updated last year
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆58Updated last year
- RefVOS☆29Updated 4 years ago
- OVSegmentor, CVPR23☆59Updated last year
- ☆51Updated 2 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆60Updated 2 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆189Updated last year
- Self-Supervised Video Representation Learning with Motion-Aware Masked Autoencoders☆23Updated last year
- Tracking with Human-Intent Reasoning☆72Updated 11 months ago
- ☆48Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Updated 11 months ago
- COCO API Customized for OVIS evaluation☆15Updated 3 years ago
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆156Updated 2 years ago