PinxueGuo / X-Prompt
☆13Updated 6 months ago
Alternatives and similar repositories for X-Prompt:
Users that are interested in X-Prompt are comparing it to the libraries listed below
- Awesome video instance segmentation papers☆39Updated this week
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆40Updated 3 months ago
- Fast and general video object segmentation evaluation.☆31Updated last year
- A list of referring video object segmentation papers☆32Updated last week
- ☆41Updated 6 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆18Updated last month
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆11Updated 6 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆70Updated 6 months ago
- ☆25Updated 10 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆15Updated 9 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆42Updated last week
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆25Updated 5 months ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆16Updated 2 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆43Updated 2 months ago
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆31Updated 6 months ago
- Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)☆22Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆29Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆14Updated 6 months ago
- Transactions on Multimedia (TMM25)☆12Updated last week
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆14Updated 6 months ago
- ☆20Updated 8 months ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆12Updated 4 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆78Updated 9 months ago
- EPCFormer: Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation☆9Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Updated 2 years ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆83Updated 8 months ago
- ☆11Updated 3 weeks ago
- ☆16Updated 6 months ago
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆34Updated 7 months ago
- ☆22Updated last year