LivingSkyTechnologies / Dense_Article_Dataset_DAD
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Dense_Article_Dataset_DAD
- Repository to use/train segmentation models for document layout analysis☆19Updated 2 years ago
- ☆74Updated 2 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- ☆37Updated 3 years ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆35Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- ☆55Updated 3 years ago
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Updated 2 years ago
- ☆23Updated 3 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆19Updated 3 years ago
- Publicly released code for the LAMBERT model☆102Updated 3 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆25Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 3 years ago
- ☆15Updated 4 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated 6 months ago
- ☆18Updated last year
- Extraction of meaningful instances from document images with a Chargrid model☆34Updated 3 years ago
- Official implementation for Dessurt☆56Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆49Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆117Updated 5 months ago
- Code for: U. Khan, S. Zahid, M.A. Ali, A. Ul-Hasan and F. Shafait, TabAug: Data Driven Augmentation for Enhanced Table Structure Recognit…☆7Updated 3 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- ☆41Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 4 years ago