mineshmathew / DocVQA
baselines for DocVQA dataset
☆21Updated 3 years ago
Alternatives and similar repositories for DocVQA:
Users that are interested in DocVQA are comparing it to the libraries listed below
- Document Visual Question Answering☆115Updated 4 years ago
- Publicly released code for the LAMBERT model☆103Updated 3 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆52Updated last year
- An implementation of the Splitting and Merging table recognition method.☆78Updated 5 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆54Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 2 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆45Updated 4 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆41Updated 2 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆28Updated 5 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆23Updated 4 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- ☆104Updated 4 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆55Updated this week
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 2 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆67Updated 4 months ago
- ☆41Updated 4 years ago
- Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)☆118Updated 4 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Official implementation for Dessurt☆58Updated 2 years ago
- ☆50Updated 10 months ago
- running LayoutLMv2☆11Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆67Updated last year
- ReS2TIM: Reconstruct Syntactic Structures from Table Images☆22Updated 4 years ago
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Updated last year
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆67Updated last year
- ☆87Updated 5 years ago
- ☆23Updated last year
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆49Updated 5 years ago