baselines for DocVQA dataset
☆21Apr 11, 2021Updated 4 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Document Visual Question Answering☆130Jul 30, 2020Updated 5 years ago
- ☆188May 8, 2024Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Mar 2, 2023Updated 3 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Vietnamese handwritten text recognition system☆18May 2, 2021Updated 4 years ago
- ☆52May 28, 2024Updated last year
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- ☆25Jun 25, 2021Updated 4 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- Synthetic Dataset Generation: Recovering Homography from Camera Captured Documents☆20May 13, 2019Updated 6 years ago
- Publicly released code for the LAMBERT model☆105Jun 14, 2021Updated 4 years ago
- Detectron2 for Document Layout Analysis☆187Aug 2, 2024Updated last year
- From document (PDF) or document images to analysis ready semi-structured data.☆20Nov 4, 2022Updated 3 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆18Apr 11, 2023Updated 2 years ago
- 飞桨模型加密库☆10Nov 13, 2021Updated 4 years ago
- ☆10Oct 1, 2020Updated 5 years ago
- ☆11Mar 24, 2023Updated 3 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 4 years ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆19Nov 28, 2022Updated 3 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Jan 8, 2020Updated 6 years ago
- ☆15May 26, 2021Updated 4 years ago
- AgriBrain is an agriculture and irrigation management system that integrates monitoring, analysis and automation into a single platform☆13Jun 30, 2019Updated 6 years ago
- Examples using MLX Swift☆13Apr 9, 2025Updated 11 months ago
- A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations☆15Oct 8, 2018Updated 7 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Nov 27, 2022Updated 3 years ago
- Predicting agriculture crop yields using climate modeling and deep learning.☆13Jun 2, 2017Updated 8 years ago
- EAST: An Efficient and Accurate Scene Text Detector☆15Jan 22, 2018Updated 8 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Apr 5, 2022Updated 3 years ago
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆10Nov 20, 2020Updated 5 years ago
- 天池大数据竞赛2017—广东政务数据创新大赛—智能算法赛☆10Apr 1, 2018Updated 7 years ago
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Oct 17, 2022Updated 3 years ago
- ☆14Sep 23, 2025Updated 6 months ago
- The codebase for the paper: A Closer Look at How Fine-tuning Changes BERT☆22Apr 3, 2023Updated 2 years ago
- Table Recognition and Content Extraction in PDF Files☆23Apr 22, 2019Updated 6 years ago