DexiangHong / MANet
☆10Updated 2 years ago
Alternatives and similar repositories for MANet:
Users that are interested in MANet are comparing it to the libraries listed below
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- RefVOS☆29Updated 3 years ago
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆60Updated 3 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆48Updated 11 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆27Updated 10 months ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 2 years ago
- Refer-Youtube-VOS dataset☆25Updated 11 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆98Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆99Updated 11 months ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Updated 2 years ago
- [CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection☆49Updated last year
- ☆35Updated 2 years ago
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆26Updated last month
- Code for the paper "Visual Recognition by Request".☆44Updated 2 years ago
- ☆78Updated 2 years ago
- ☆37Updated 2 years ago
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆39Updated 2 years ago
- ☆74Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆41Updated last year
- OVSegmentor, CVPR23☆57Updated 8 months ago
- (IJCV 2024&ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆19Updated 2 years ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆63Updated 7 months ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆24Updated 4 months ago
- A lightweight codebase for referring expression comprehension and segmentation☆52Updated 2 years ago
- Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.☆14Updated 4 years ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆45Updated 10 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆46Updated 6 months ago