injadlu / VCR
☆13Updated 2 months ago
Alternatives and similar repositories for VCR:
Users that are interested in VCR are comparing it to the libraries listed below
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆46Updated 10 months ago
- CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning☆14Updated last year
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆104Updated 4 months ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆101Updated last year
- ☆23Updated last year
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆40Updated last month
- ☆46Updated last year
- AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆19Updated 8 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆66Updated 2 weeks ago
- Official PyTorch Code for "ATPrompt: Textual Prompt Learning with Embedded Attributes"☆33Updated 4 months ago
- 🔥MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition [Official, ICCV 2023]☆30Updated 6 months ago
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆27Updated 3 weeks ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆35Updated 7 months ago
- (CVPR2023) official code of Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization☆30Updated last year
- cliptrase☆36Updated 8 months ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆20Updated this week
- Instruction Tuning in Continual Learning paradigm☆47Updated 3 months ago
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆17Updated 8 months ago
- Easy wrapper for inserting LoRA layers in CLIP.☆31Updated 10 months ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆14Updated 10 months ago
- ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"☆15Updated 9 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆72Updated 10 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆51Updated 8 months ago
- GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery (CVPR2025)☆18Updated last month
- Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"☆78Updated last year
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆11Updated 4 months ago
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆81Updated last year
- Multimodal-Composite-Editing-and-Retrieval-update☆32Updated 6 months ago
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆37Updated last month
- ☆15Updated last year