wjf5203 / VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
☆609Updated 11 months ago
Alternatives and similar repositories for VNext:
Users that are interested in VNext are comparing it to the libraries listed below
- SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)☆345Updated 2 years ago
- ☆267Updated 2 months ago
- Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022☆540Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…☆1,256Updated last year
- [ECCV'22 Oral] Towards Grand Unification of Object Tracking☆950Updated 2 years ago
- Mask-Free Video Instance Segmentation [CVPR 2023]☆361Updated 10 months ago
- Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral☆239Updated last year
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆471Updated 3 years ago
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆314Updated 2 years ago
- [CVPR2022] Official Implementation of ReferFormer☆336Updated this week
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆685Updated last year
- [ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"☆533Updated last year
- Global Tracking Transformers, CVPR 2022☆379Updated 2 years ago
- [NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation☆550Updated 11 months ago
- [NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentati…☆344Updated 2 years ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆429Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,303Updated last year
- Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation, NeurIPS 2021 Spotlight☆363Updated 2 years ago
- An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch☆566Updated 11 months ago
- [CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised …☆471Updated 3 months ago
- [ICCV 2023] You Only Look at One Partial Sequence☆340Updated last year
- Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]☆542Updated last year
- [ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".☆308Updated last year
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆716Updated last year
- Object detection on multiple datasets with an automatically learned unified label space.☆501Updated 11 months ago
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆396Updated 3 months ago
- [CVPR 2022 Oral] Official implementation of DN-DETR☆555Updated last year
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,379Updated 2 years ago
- [CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers☆744Updated 3 years ago
- This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…☆369Updated last year