mineshmathew / DocVQALinks
baselines for DocVQA dataset
☆21Updated 4 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below
Sorting:
- Document Visual Question Answering☆127Updated 5 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆42Updated 3 years ago
- Publicly released code for the LAMBERT model☆103Updated 4 years ago
- ☆42Updated 4 years ago
- ☆107Updated 4 years ago
- Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)☆122Updated 5 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆65Updated 11 months ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Updated 2 years ago
- ☆80Updated 2 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆45Updated 5 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 3 years ago
- running LayoutLMv2☆11Updated 3 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- An implementation of the Splitting and Merging table recognition method.☆79Updated 5 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Updated 4 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- ☆51Updated last year
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Updated 2 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆67Updated 2 years ago
- ☆87Updated 5 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the fir…☆177Updated 3 years ago
- EATEN: Entity-aware Attention for Single Shot Visual Text Extraction☆181Updated 5 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆70Updated last year
- [CVPR 2019] "Handwriting Recognition in Low-resource Scripts using Adversarial Learning ”, IEEE Conf. on Computer Vision and Pattern Reco…☆63Updated 6 years ago
- GCN use for semi-construct document information extraction.☆22Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago