kdavila / ChartInfo_annotation_tools
Release for CHART annotation tools used for ICDAR CHART 2019 competition
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ChartInfo_annotation_tools
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆51Updated 3 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆73Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆35Updated last year
- running LayoutLMv2☆11Updated 2 years ago
- DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018☆32Updated 5 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆56Updated 2 years ago
- SciCap Dataset☆48Updated 3 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago
- ☆50Updated 5 months ago
- Scene Text Aware Cross Modal Retrieval (StacMR)☆24Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 2 years ago
- ☆26Updated 5 years ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16Updated last year
- ☆37Updated 3 years ago
- ☆44Updated 3 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆78Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆45Updated last month
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models ”☆17Updated last year
- ☆129Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- Synthetic Dataset used in the ICDAR2019 Competition on HArvesting Raw Tables from Infographics (CHART-Infographics)☆19Updated 5 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆33Updated 2 years ago
- baselines for DocVQA dataset☆20Updated 3 years ago
- Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.☆63Updated 3 years ago
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…☆119Updated 3 years ago
- A modular framework for Visual Question Answering research by the FAIR A-STAR team☆45Updated 3 years ago
- ☆34Updated last year