kingthreestones / RefCLIPLinks
☆35Updated last year
Alternatives and similar repositories for RefCLIP
Users that are interested in RefCLIP are comparing it to the libraries listed below
Sorting:
- A lightweight codebase for referring expression comprehension and segmentation☆55Updated 3 years ago
- ☆21Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated 7 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆137Updated 7 months ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆101Updated last year
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆97Updated 2 years ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Updated 2 years ago
- ☆92Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆67Updated 3 years ago
- cliptrase☆36Updated 9 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆52Updated last year
- ☆22Updated last year
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆25Updated 11 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆70Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 10 months ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆50Updated last year
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆90Updated last week
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated last year
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆62Updated last year
- [ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"☆69Updated 7 months ago
- ☆37Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- ☆23Updated 2 years ago
- ☆38Updated 3 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 5 months ago
- ☆30Updated last year
- ☆12Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆45Updated 9 months ago