pumpkin805 / FALIP
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆13Updated 4 months ago
Alternatives and similar repositories for FALIP:
Users that are interested in FALIP are comparing it to the libraries listed below
- [ICML2024]The official implementation of SemiRES in PyTorch.☆24Updated 7 months ago
- cliptrase☆29Updated 4 months ago
- Video Reasoning Segmentation☆19Updated 2 months ago
- ☆33Updated last week
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆40Updated 2 weeks ago
- ☆33Updated last month
- A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆39Updated last month
- ☆32Updated last year
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆68Updated 6 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆66Updated 3 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆32Updated 3 weeks ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆39Updated 6 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆48Updated 11 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆31Updated 10 months ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆23Updated 2 months ago
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆102Updated 2 weeks ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆77Updated 5 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆46Updated 5 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆72Updated 4 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆74Updated 5 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆24Updated last week
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆11Updated 6 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆61Updated 5 months ago
- ☆27Updated 4 months ago
- ☆22Updated 6 months ago
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆25Updated last year
- Official implementation of TagAlign☆34Updated last month
- Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆23Updated 2 weeks ago
- ☆39Updated last year
- OVSegmentor, CVPR23☆57Updated 9 months ago