alipay / POALinks
☆21Updated 11 months ago
Alternatives and similar repositories for POA
Users that are interested in POA are comparing it to the libraries listed below
Sorting:
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆35Updated last year
- ☆16Updated 11 months ago
- ☆10Updated 4 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆17Updated 11 months ago
- ☆17Updated last year
- A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)☆19Updated 8 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆47Updated last month
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆41Updated 10 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆24Updated 9 months ago
- Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"☆25Updated 4 months ago
- ☆17Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆15Updated 3 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆39Updated last year
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆40Updated 5 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆26Updated 2 months ago
- ☆32Updated last year
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆56Updated 8 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆51Updated 2 months ago
- ☆31Updated 10 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆44Updated 2 years ago
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆20Updated 3 weeks ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆36Updated 4 months ago
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆60Updated 5 months ago
- Open-vocabulary Semantic Segmentation☆33Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- ☆14Updated last year
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 2 weeks ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆44Updated last year