ChenDelong1999 / ITRA
A codebase for flexible and efficient Image Text Representation Alignment
☆14Updated last year
Related projects: ⓘ
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆56Updated 10 months ago
- A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)☆10Updated 9 months ago
- ☆18Updated last month
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆27Updated 8 months ago
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆11Updated 8 months ago
- 🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (R…☆36Updated 5 months ago
- [Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning☆46Updated 7 months ago
- Word4Per is an innovative framework for Zero-Shot Composed Person Retrieval (ZS-CPR), integrating visual and textual information for enha…☆13Updated 9 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆80Updated 9 months ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆51Updated 10 months ago
- Code release for Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning☆34Updated 3 months ago
- ☆12Updated last year
- Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images, TGRS 2024.☆20Updated last week
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆65Updated 8 months ago
- The official implementation for paper: Revisting the Power of Prompt for Visual Tuning.☆10Updated 2 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆63Updated last month
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆48Updated 3 months ago
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆39Updated 5 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- 🔥MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition [Official, ICCV 2023]☆25Updated 6 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆62Updated 7 months ago
- [AAAI2024] Official implementation of the AAAI 2024 paper TGP-T☆25Updated 5 months ago
- ☆85Updated 11 months ago
- About Code Release for "CLIPood: Generalizing CLIP to Out-of-Distributions" (ICML 2023), https://arxiv.org/abs/2302.00864☆58Updated last year
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆40Updated last year
- Collection of Remote Sensing Vision-Language Models☆117Updated 4 months ago
- Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning" (published at ICLR 202…☆48Updated last year
- This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"☆21Updated 3 months ago
- [IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".☆12Updated last year
- Multi-label Image Recognition with Partial Labels (IJCV'24, ESWA'24, AAAI'22)☆34Updated 2 months ago