ivattyue / SC-Tune
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Updated 4 months ago
Related projects: ⓘ
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆62Updated 7 months ago
- [CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆82Updated last month
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆73Updated 9 months ago
- Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".☆39Updated 3 weeks ago
- ☆81Updated 9 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆23Updated 2 months ago
- Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"☆15Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆63Updated last month
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆17Updated 4 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆27Updated 3 weeks ago
- [AAAI2024] Official implementation of the AAAI 2024 paper TGP-T☆25Updated 5 months ago
- Composed Video Retrieval☆42Updated 4 months ago
- ☆85Updated 11 months ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆88Updated 6 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆93Updated 7 months ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆58Updated last week
- [NeurIPS 2023] Generalized Logit Adjustment☆33Updated 5 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆63Updated 3 months ago
- Code for paper "AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention"☆13Updated 2 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆38Updated 2 weeks ago
- cliptrase☆15Updated 2 weeks ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆23Updated 6 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆219Updated 11 months ago
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆48Updated 3 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆137Updated 9 months ago
- ☆31Updated last year
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆42Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆26Updated 6 months ago