PinxueGuo / X-Prompt
☆10Updated 4 months ago
Alternatives and similar repositories for X-Prompt:
Users that are interested in X-Prompt are comparing it to the libraries listed below
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆39Updated last month
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆14Updated 4 months ago
- Fast and general video object segmentation evaluation.☆29Updated last year
- ☆29Updated 10 months ago
- ☆38Updated 4 months ago
- Video Feature Enhancement with PyTorch☆26Updated 2 months ago
- Awesome video instance segmentation papers☆35Updated this week
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆12Updated 4 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆27Updated 11 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆11Updated 4 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆31Updated 2 months ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆19Updated 3 months ago
- ☆24Updated 8 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆18Updated 3 weeks ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆13Updated 11 months ago
- EPCFormer: Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation☆9Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆39Updated 3 weeks ago
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆68Updated this week
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆49Updated 3 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated 2 months ago
- Detectron2 Toolbox and Benchmark for V3Det☆16Updated 8 months ago
- ☆25Updated 3 months ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆42Updated 3 months ago
- ☆36Updated last month
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆15Updated last week
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆74Updated 7 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆32Updated 8 months ago