heshuting555 / DsHmpLinks
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
☆85Updated last year
Alternatives and similar repositories for DsHmp
Users that are interested in DsHmp are comparing it to the libraries listed below
Sorting:
- Multimodal Referring Segmentation☆190Updated 2 weeks ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆84Updated 3 months ago
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆240Updated 3 weeks ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆71Updated last month
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆190Updated 2 years ago
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Updated last year
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆361Updated 2 months ago
- ☆184Updated this week
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆82Updated 2 years ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆81Updated 2 months ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆360Updated 3 years ago
- [ICCV 2025] Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation☆53Updated 3 months ago
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆120Updated 3 months ago
- [ICCV 2023 & TPAMI 2025] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆520Updated this week
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆690Updated 2 weeks ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Updated last year
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆23Updated 3 weeks ago
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆45Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆95Updated last year
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆24Updated 11 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆64Updated 5 months ago
- Code of BRIDGE: Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation☆114Updated 2 months ago
- ☆59Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Updated last year
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆312Updated 2 years ago
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Updated last year
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆91Updated 11 months ago
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆138Updated 5 months ago