LivingSkyTechnologies / Dense_Article_Dataset_DAD
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Dense_Article_Dataset_DAD
- Repository to use/train segmentation models for document layout analysis☆19Updated 2 years ago
- ☆23Updated 3 years ago
- ☆75Updated 2 years ago
- ☆55Updated 3 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Updated 2 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- ☆37Updated 3 years ago
- Publicly released code for the LAMBERT model☆102Updated 3 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 3 years ago
- Extraction of meaningful instances from document images with a Chargrid model☆34Updated 3 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆49Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 4 years ago
- ☆18Updated last year
- ☆15Updated 4 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆25Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆65Updated 8 months ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- ☆9Updated 3 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆74Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆118Updated 6 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆116Updated last year
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆28Updated 5 years ago
- Code for: U. Khan, S. Zahid, M.A. Ali, A. Ul-Hasan and F. Shafait, TabAug: Data Driven Augmentation for Enhanced Table Structure Recognit…☆7Updated 3 years ago
- Evaluation of the Layoutlm model on the CORD dataset☆32Updated 2 years ago