[CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
☆31Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for HybridGL
Users that are interested in HybridGL are comparing it to the libraries listed below
Sorting:
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated 11 months ago
- ☆14Jul 8, 2024Updated last year
- [AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation☆67May 21, 2025Updated 9 months ago
- Code release for "Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation"☆14Oct 23, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆23Nov 17, 2025Updated 3 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated 2 years ago
- [CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"☆129Mar 17, 2025Updated 11 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆45Jul 11, 2024Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- ☆23Aug 20, 2024Updated last year
- ☆28Jul 22, 2024Updated last year
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆40Sep 10, 2025Updated 5 months ago
- [ICLR 2024] The official implementation of Zip-Your-Clip☆35Mar 14, 2024Updated last year
- ☆31Jun 14, 2024Updated last year
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- ☆22May 18, 2025Updated 9 months ago
- Similar to the 2D Base Model, 3D Base Model is a bridge between images and 3D data.☆25Updated this week
- ☆13Jul 3, 2024Updated last year
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 4 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 10 months ago
- Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2…☆25Sep 1, 2025Updated 6 months ago
- ☆88Dec 3, 2025Updated 3 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆97Mar 26, 2025Updated 11 months ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆46Jan 8, 2025Updated last year
- accepted by MICCAI2024☆44Nov 28, 2024Updated last year
- The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …☆12Dec 27, 2018Updated 7 years ago
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges☆17Jul 25, 2025Updated 7 months ago
- ☆16Feb 23, 2025Updated last year
- ☆16Dec 25, 2025Updated 2 months ago
- ☆13Sep 8, 2024Updated last year
- ☆14Sep 11, 2025Updated 5 months ago
- ☆13Apr 10, 2025Updated 10 months ago
- An Android WebView with full screen video☆10Aug 17, 2017Updated 8 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆20Nov 21, 2025Updated 3 months ago
- ☆13May 15, 2025Updated 9 months ago