vis-nlp / OpenCQA
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OpenCQA
- ☆106Updated 4 months ago
- SciCap Dataset☆49Updated 3 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆70Updated last year
- ☆33Updated 6 months ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆27Updated 10 months ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Updated 4 years ago
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆35Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆43Updated 5 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆23Updated 5 months ago
- ☆28Updated last year
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆16Updated 5 months ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16Updated last year
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆67Updated 4 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆19Updated 2 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆62Updated 7 months ago
- ☆64Updated 3 months ago
- ☆46Updated last month
- ☆113Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆78Updated last year
- ☆16Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆84Updated 2 months ago
- ☆171Updated 4 months ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆41Updated last month
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆51Updated 3 years ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆57Updated 4 months ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆61Updated 2 years ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆39Updated 8 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆26Updated 4 months ago
- ☆54Updated 10 months ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆110Updated last month