[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆17Sep 11, 2024Updated last year
Alternatives and similar repositories for FALIP
Users that are interested in FALIP are comparing it to the libraries listed below
Sorting:
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆42May 21, 2025Updated 10 months ago
- ☆12Dec 17, 2024Updated last year
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 8 months ago
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆14Mar 26, 2025Updated 11 months ago
- ☆23Aug 20, 2024Updated last year
- ☆18Jun 16, 2025Updated 9 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆101Oct 29, 2025Updated 4 months ago
- ☆12Sep 6, 2023Updated 2 years ago
- Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training☆24Jan 2, 2025Updated last year
- ☆14Nov 7, 2024Updated last year
- MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images☆27Apr 3, 2025Updated 11 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆21Sep 5, 2025Updated 6 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆54Feb 1, 2024Updated 2 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated last year
- code for FineLIP☆40Nov 25, 2025Updated 3 months ago
- Project page of the paper 'Deep Learning for Handling Kernel/model Uncertainty in Image Deconvolution' (CVPR 2020)☆17Aug 2, 2020Updated 5 years ago
- [ICCV 2023] Bayesian Prompt Learning for Image-Language Model Generalization☆40Oct 6, 2023Updated 2 years ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)☆42Dec 2, 2022Updated 3 years ago
- Project page of the paper 'Variational-EM-based Deep Learning for Noise-blind Image Deblurring' (CVPR 2020)☆14Jan 27, 2022Updated 4 years ago
- ☆15Apr 1, 2020Updated 5 years ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆92Jul 4, 2024Updated last year
- ☆12Dec 12, 2017Updated 8 years ago
- Official repository of OS-FPI☆17Dec 22, 2024Updated last year
- ☆14Jul 8, 2024Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆33Jun 20, 2024Updated last year
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆104Mar 6, 2024Updated 2 years ago
- [ICML'24] Open-Vocabulary Calibration for Fine-tuned CLIP☆18Jun 14, 2024Updated last year
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated 2 months ago
- Starting from MICCAI 2021, all accepted papers and their corresponding reviews are publicly available on OpenReview. This repo contains s…☆58Oct 1, 2021Updated 4 years ago
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆11Sep 23, 2024Updated last year
- Early Accepted in MICCAI 2023☆12Jul 11, 2023Updated 2 years ago
- DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization☆17Jul 26, 2024Updated last year
- SAM Adaptation using SVD☆12Jul 13, 2025Updated 8 months ago
- The official implementation of RAR☆91Dec 9, 2025Updated 3 months ago
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆16Jul 24, 2025Updated 7 months ago
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 5 months ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year