mineshmathew / DocVQALinks
baselines for DocVQA dataset
☆21Updated 4 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below
Sorting:
- Document Visual Question Answering☆120Updated 4 years ago
- Publicly released code for the LAMBERT model☆103Updated 4 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆53Updated last year
- ☆42Updated 4 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆41Updated 3 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)☆122Updated 4 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆66Updated 7 months ago
- ☆104Updated 4 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Updated 4 years ago
- An implementation of the Splitting and Merging table recognition method.☆79Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 4 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 2 years ago
- This repo contains the code for "Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks", ICDAR 2019..☆19Updated last year
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆60Updated 2 years ago
- ☆26Updated 2 years ago
- ☆80Updated 2 years ago
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Updated last year
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆45Updated 4 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 3 years ago
- Synthetic Dataset used in the ICDAR2019 Competition on HArvesting Raw Tables from Infographics (CHART-Infographics)☆20Updated 6 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Updated 2 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆68Updated last year
- Release for CHART annotation tools used for ICDAR CHART 2019 competition☆27Updated last year
- EATEN: Entity-aware Attention for Single Shot Visual Text Extraction☆180Updated 5 years ago
- A modular framework for Visual Question Answering research by the FAIR A-STAR team☆45Updated 3 years ago