dzh19990407 / LBDT
CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
☆23Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for LBDT
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 9 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆25Updated 7 months ago
- ☆33Updated last year
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆68Updated 4 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆22Updated 8 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 3 months ago
- RefVOS☆28Updated 3 years ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- SeqTR: A Simple yet Universal Network for Visual Grounding☆130Updated last week
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated 11 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆25Updated 3 weeks ago
- ☆32Updated 7 months ago
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆59Updated 3 years ago
- ☆47Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆65Updated 2 years ago
- ☆21Updated 2 years ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆45Updated 8 months ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆56Updated last year
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆39Updated 2 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆31Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆23Updated last year
- Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks☆21Updated 2 years ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆55Updated 2 weeks ago
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆37Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated last year