Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
☆54Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for FGVP
Users that are interested in FGVP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆87Apr 15, 2022Updated 3 years ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆107Nov 24, 2025Updated 4 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- ☆13Oct 25, 2024Updated last year
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- ☆19Nov 7, 2024Updated last year
- Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM☆45Oct 12, 2024Updated last year
- ☆10Aug 31, 2023Updated 2 years ago
- RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition☆11Dec 14, 2021Updated 4 years ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆71Jan 19, 2024Updated 2 years ago
- ☆32Oct 6, 2024Updated last year
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆146Jun 20, 2024Updated last year
- ☆23Aug 20, 2024Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- The official implementation of InterBERT☆11Oct 18, 2022Updated 3 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆166Nov 8, 2025Updated 4 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆44Jul 23, 2024Updated last year
- ☆25Jul 10, 2023Updated 2 years ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆92Jul 4, 2024Updated last year
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".☆129Nov 7, 2024Updated last year
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆54Feb 10, 2025Updated last year
- ☆22Dec 28, 2024Updated last year
- Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"☆52Nov 16, 2023Updated 2 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- ☆22Dec 9, 2022Updated 3 years ago
- ☆95Sep 23, 2023Updated 2 years ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,213Sep 2, 2023Updated 2 years ago
- This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in☆25Feb 4, 2024Updated 2 years ago
- [ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"☆149Apr 21, 2024Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆101Oct 29, 2025Updated 4 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆233Jun 1, 2025Updated 9 months ago
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆28May 8, 2025Updated 10 months ago
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆42May 21, 2025Updated 10 months ago