ZiyuGuo99 / CALIP
[AAAI 2023] Zero-Shot Enhancement of CLIP with Parameter-free Attention
☆84Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CALIP
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆53Updated 3 months ago
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆207Updated 5 months ago
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆150Updated 3 months ago
- [ICPR'24 Oral] Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery☆24Updated 4 months ago
- Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023☆88Updated last year
- Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".☆197Updated 6 months ago
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆77Updated 11 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆94Updated last year
- ☆89Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆108Updated 3 weeks ago
- FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.☆102Updated this week
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆80Updated 4 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆40Updated last year
- Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Ne…☆133Updated 4 months ago
- Cross-modal few-shot adaptation with CLIP☆315Updated 7 months ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆81Updated 3 weeks ago
- ☆30Updated 6 months ago
- [CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆40Updated 3 months ago
- [ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"☆139Updated 6 months ago
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆43Updated last year
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning☆56Updated 3 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆145Updated 10 months ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆86Updated 7 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆28Updated 2 weeks ago
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆56Updated 5 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆73Updated 3 months ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆54Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆51Updated last year
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆93Updated 8 months ago