DocILE: Document Information Localization and Extraction Benchmark
☆142May 15, 2024Updated last year
Alternatives and similar repositories for docile
Users that are interested in docile are comparing it to the libraries listed below
Sorting:
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- ☆45Jul 18, 2022Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆362Oct 31, 2022Updated 3 years ago
- ReS2TIM: Reconstruct Syntactic Structures from Table Images☆23Sep 10, 2020Updated 5 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆203Mar 1, 2025Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 4 months ago
- ☆81Jun 12, 2023Updated 2 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 4 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,501Jun 2, 2023Updated 2 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Feb 8, 2023Updated 3 years ago
- ☆22May 5, 2021Updated 4 years ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆103May 30, 2024Updated last year
- https://dl.acm.org/doi/10.1145/3657281☆97Apr 25, 2024Updated last year
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Jan 9, 2024Updated 2 years ago
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆569Jul 25, 2024Updated last year
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆572Jun 14, 2024Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Sep 12, 2024Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 5 months ago
- Research papers and code on information extraction from image/pdf☆97Nov 25, 2022Updated 3 years ago
- ☆69Jan 9, 2024Updated 2 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- ☆108Feb 16, 2021Updated 5 years ago
- ☆132Mar 24, 2023Updated 2 years ago
- OCR toolbox from Davar-Lab☆759Nov 16, 2023Updated 2 years ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- ☆51May 28, 2024Updated last year
- ☆142Feb 13, 2024Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Apr 3, 2024Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Sep 19, 2022Updated 3 years ago
- A curated list of papers about key information extraction.☆105Dec 18, 2024Updated last year
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated last year
- GCN use for semi-construct document information extraction.☆21Aug 5, 2023Updated 2 years ago