LeapLabTHU / Cross-Modal-Adapter
[arXiv] Cross-Modal Adapter for Text-Video Retrieval
☆55Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Cross-Modal-Adapter
- ☆83Updated 11 months ago
- Official implementation of Dynamic Perceiver☆41Updated last year
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆25Updated last month
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆48Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆144Updated 4 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆94Updated last year
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆106Updated 2 years ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆45Updated last year
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆108Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆69Updated 9 months ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆151Updated last year
- [CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆41Updated 3 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆66Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆146Updated 11 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- NegCLIP.☆26Updated last year
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆41Updated 4 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆37Updated 11 months ago
- ☆56Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆78Updated 8 months ago
- Instruction Tuning in Continual Learning paradigm☆26Updated 4 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated last month
- ☆27Updated last year
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆56Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆38Updated 7 months ago
- ☆36Updated 7 months ago
- Toolkit for Elevater Benchmark☆67Updated last year
- ☆22Updated last year
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆29Updated last month