tingyaohsu / SciCap
SciCap Dataset
☆54Updated 3 years ago
Alternatives and similar repositories for SciCap:
Users that are interested in SciCap are comparing it to the libraries listed below
- ☆109Updated 6 months ago
- ☆66Updated 5 months ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆52Updated 3 years ago
- ☆114Updated 2 years ago
- ☆11Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated last year
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆72Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆31Updated last month
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆42Updated 3 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆22Updated 4 months ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆151Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆85Updated last year
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Updated last year
- ☆175Updated 6 months ago
- Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering☆49Updated 2 years ago
- ☆44Updated 9 months ago
- ☆47Updated 3 weeks ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- LLM for Scientific Research Survey☆34Updated last week
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆38Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆77Updated 2 months ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆79Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆63Updated 2 years ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆23Updated 7 months ago
- ☆29Updated last year
- Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text☆24Updated 2 years ago
- [NAACL 2022] Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning.☆56Updated 9 months ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆39Updated 10 months ago
- ☆16Updated last year
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 7 months ago