zwq456 / CLIP-VIS
Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.
☆34Updated 3 weeks ago
Related projects: ⓘ
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆62Updated 2 months ago
- ☆26Updated last week
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆47Updated 2 months ago
- Open-vocabulary Semantic Segmentation☆32Updated 7 months ago
- ☆27Updated 5 months ago
- ☆17Updated 5 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆23Updated 6 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆73Updated 2 months ago
- Detectron2 Toolbox and Benchmark for V3Det☆15Updated 3 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆20Updated last month
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆43Updated 2 months ago
- ☆58Updated this week
- ☆16Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆67Updated last month
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆23Updated 5 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆45Updated 4 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆54Updated 2 months ago
- ☆57Updated last year
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆32Updated last month
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆48Updated 11 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆11Updated 6 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆24Updated 2 months ago
- DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution☆34Updated 2 months ago
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆15Updated 6 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- [ICCV 2023] PyTorch implementation of RandBox☆51Updated 10 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆49Updated last month
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆63Updated 3 months ago
- The official implementation of RAR☆61Updated 5 months ago
- ☆20Updated this week