GaryJiajia / OFv2_ICL_VQAView external linksLinks
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
☆21May 28, 2025Updated 8 months ago
Alternatives and similar repositories for OFv2_ICL_VQA
Users that are interested in OFv2_ICL_VQA are comparing it to the libraries listed below
Sorting:
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- We present a new method for long-tailed out-of-distribution detection☆16Jan 20, 2025Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated last year
- ☆17Mar 13, 2023Updated 2 years ago
- Using image captions with LLM for zero-shot VQA☆18Mar 14, 2024Updated last year
- 【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185☆22May 31, 2025Updated 8 months ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆25Dec 14, 2023Updated 2 years ago
- visual question answering prompting recipes for large vision-language models☆28Sep 14, 2024Updated last year
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆41Mar 23, 2024Updated last year
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 6 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆44Sep 24, 2024Updated last year
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- ☆11Jul 24, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- ☆10Nov 12, 2024Updated last year
- General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.☆40Updated this week
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆19Jul 3, 2025Updated 7 months ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆44Mar 28, 2024Updated last year
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- ☆13Aug 17, 2022Updated 3 years ago
- 云任务调度仿真平台☆12Mar 11, 2020Updated 5 years ago
- ☆10Jul 11, 2022Updated 3 years ago
- SEU Summer School project, based on Kotlin and Java.☆13Sep 15, 2023Updated 2 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 5 months ago
- ☆12Apr 25, 2025Updated 9 months ago
- ☆11Apr 10, 2024Updated last year
- Library for automatic time series forecasting based on ARIMA models☆12May 14, 2017Updated 8 years ago
- ☆12Dec 20, 2024Updated last year
- ☆17Dec 23, 2025Updated last month
- ☆10May 4, 2018Updated 7 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Jun 8, 2025Updated 8 months ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆33Dec 4, 2025Updated 2 months ago
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 8 months ago