LinfengYuan1997 / LoShView external linksLinks
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated last year
Alternatives and similar repositories for LoSh
Users that are interested in LoSh are comparing it to the libraries listed below
Sorting:
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Sep 5, 2025Updated 5 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Mar 13, 2024Updated last year
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆29Apr 28, 2025Updated 9 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated last year
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- ☆18Nov 15, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Apr 7, 2023Updated 2 years ago
- [NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation☆19Dec 22, 2024Updated last year
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆39Apr 21, 2024Updated last year
- Awesome video instance segmentation papers☆51Dec 17, 2025Updated last month
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Sep 24, 2024Updated last year
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Sep 27, 2023Updated 2 years ago
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆47Jan 7, 2025Updated last year
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆66Jun 23, 2025Updated 7 months ago
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆91Dec 23, 2024Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆57Oct 7, 2023Updated 2 years ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Aug 12, 2022Updated 3 years ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆25Dec 30, 2024Updated last year
- Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)☆26Apr 2, 2024Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆60Feb 28, 2025Updated 11 months ago
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Jul 24, 2024Updated last year
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆27Jun 15, 2025Updated 7 months ago
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Nov 21, 2023Updated 2 years ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆59Nov 10, 2025Updated 3 months ago
- Segment Anything with Deictic Prompting☆27May 13, 2025Updated 9 months ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆41Aug 4, 2025Updated 6 months ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆111Apr 9, 2025Updated 10 months ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Dec 23, 2024Updated last year
- An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"☆23Jan 27, 2022Updated 4 years ago
- ☆32Mar 25, 2024Updated last year
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆37Oct 11, 2024Updated last year