showlab / CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆37Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for CLVQA
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆29Updated 7 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- ☆19Updated last year
- ☆25Updated 9 months ago
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆29Updated last month
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- ☆56Updated 2 years ago
- NegCLIP.☆26Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆37Updated 10 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆32Updated 4 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆17Updated last month
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆35Updated last year
- Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection☆41Updated 5 months ago
- [CVPR2022] PyTorch re-implementation of Prompt Distribution Learning☆15Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆17Updated last month
- ☆58Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 5 months ago
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆52Updated 5 months ago
- Temporal Alignment Representations with Contrastive Learning☆22Updated last year
- [NeurIPS'2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆17Updated 11 months ago
- ☆21Updated last year
- [CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆40Updated 3 months ago
- ☆17Updated last year
- ☆32Updated 7 months ago
- ☆60Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆44Updated last year