CongpeiQiu / CLIPRefinerLinks
[ICLR2025] Code Release of Refining CLlP's Spatial Awareness: A Visual-centric Perspective
☆20Updated 9 months ago
Alternatives and similar repositories for CLIPRefiner
Users that are interested in CLIPRefiner are comparing it to the libraries listed below
Sorting:
- ☆16Updated last month
- ☆20Updated last week
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆749Updated 2 months ago
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆34Updated last month
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆204Updated last year
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆59Updated 5 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆827Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆348Updated last month
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆597Updated 3 weeks ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆215Updated 10 months ago
- ☆16Updated last year
- Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric…☆54Updated 4 months ago
- ☆48Updated last year
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Updated 10 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆294Updated 2 months ago
- This is the official repo of MLLM-CL.☆61Updated 4 months ago
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆144Updated 3 weeks ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆123Updated 3 months ago
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆91Updated last year
- [NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding☆64Updated 3 months ago
- ☆10Updated last year
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆18Updated 3 months ago
- Official repository for VisionZip (CVPR 2025)☆403Updated 6 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Updated last month
- (TPAMI 2024) A Survey on Open Vocabulary Learning☆986Updated last month
- [ICLR 2025 Spotlight] This is the official repository for our paper: ''Enhancing Pre-trained Representation Classifiability can Boost its…☆25Updated 9 months ago
- ☆18Updated 9 months ago
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆27Updated last month
- code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"☆21Updated 2 months ago
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆290Updated this week