DexiangHong / MANetLinks
☆10Updated 2 years ago
Alternatives and similar repositories for MANet
Users that are interested in MANet are comparing it to the libraries listed below
Sorting:
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆53Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated 2 years ago
- Refer-Youtube-VOS dataset☆25Updated last year
- RefVOS☆29Updated 4 years ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 11 months ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆104Updated 2 years ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆30Updated last year
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆25Updated 7 months ago
- ☆36Updated 4 years ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- ☆23Updated 2 years ago
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆40Updated 2 years ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆51Updated last year
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆46Updated last month
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆55Updated last year
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆26Updated 9 months ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆93Updated last year
- Accepted by CVPR 2022☆36Updated 3 years ago
- (IJCV 2024&ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆19Updated 3 years ago
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆56Updated 2 months ago
- Official code for ECCV 2022 paper☆31Updated last year
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆62Updated 4 years ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆30Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆55Updated 3 years ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Updated last year
- OVSegmentor, CVPR23☆59Updated last year
- ☆40Updated last year