mineshmathew / DocVQALinks
baselines for DocVQA dataset
☆21Updated 4 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below
Sorting:
- Document Visual Question Answering☆131Updated 5 years ago
- Publicly released code for the LAMBERT model☆104Updated 4 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 3 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆42Updated 3 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Updated 2 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Updated 4 years ago
- ☆81Updated 2 years ago
- ☆108Updated 4 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆57Updated 10 months ago
- ☆51Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- ☆42Updated 5 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆46Updated 5 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Updated 4 years ago
- ☆22Updated 4 years ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆55Updated last year
- ☆40Updated 4 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆69Updated last year
- ☆25Updated 3 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆65Updated last year
- ☆188Updated last year
- The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the fir…☆178Updated 3 years ago
- Evaluation of the Layoutlm model on the CORD dataset☆32Updated 4 years ago
- ☆87Updated 5 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Updated 4 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago