mineshmathew / DocVQALinks
baselines for DocVQA dataset
☆21Updated 4 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below
Sorting:
- Document Visual Question Answering☆125Updated 5 years ago
- Publicly released code for the LAMBERT model☆103Updated 4 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Updated 2 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆42Updated 3 years ago
- ☆80Updated 2 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- ☆107Updated 4 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆66Updated 10 months ago
- Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)☆123Updated 5 years ago
- ☆42Updated 4 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆45Updated 4 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Updated 2 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 3 years ago
- ☆51Updated last year
- ☆40Updated 4 years ago
- time-series row column classification☆14Updated 3 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Updated 4 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago
- An implementation of the Splitting and Merging table recognition method.☆79Updated 5 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆67Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆70Updated last year
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Updated 2 years ago
- ☆25Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Updated 4 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- This is an OCR solution for receipts, invoices, etc.☆20Updated 5 years ago