Distorted Document Images dataset (DDI-100).
☆146Nov 1, 2022Updated 3 years ago
Alternatives and similar repositories for DDI-100
Users that are interested in DDI-100 are comparing it to the libraries listed below
Sorting:
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆379Jul 7, 2020Updated 5 years ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 5 months ago
- CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks☆187Oct 17, 2023Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆203Mar 1, 2025Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆635Aug 12, 2024Updated last year
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆572Jun 14, 2024Updated last year
- ☆31Dec 18, 2025Updated 2 months ago
- Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one …☆766Oct 5, 2023Updated 2 years ago
- Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)☆599Nov 10, 2024Updated last year
- A curated list of awesome synthetic data for text location and recognition☆338Jun 16, 2021Updated 4 years ago
- ☆1,040Jul 9, 2025Updated 7 months ago
- TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition☆102Dec 9, 2021Updated 4 years ago
- ☆478Jul 8, 2025Updated 7 months ago
- TableBank: A Benchmark Dataset for Table Detection and Recognition☆1,082Aug 12, 2024Updated last year
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆69Feb 24, 2024Updated 2 years ago
- Geometric Augmentation for Text Image☆492Apr 21, 2020Updated 5 years ago
- Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".☆67Jun 15, 2021Updated 4 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- Project page of SynthText3D☆149Dec 10, 2019Updated 6 years ago
- Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Recognition using Graph Neural Networks (2019)☆275Nov 22, 2022Updated 3 years ago
- ☆132Mar 24, 2023Updated 2 years ago
- Synthetic Dataset Generation: Recovering Homography from Camera Captured Documents☆20May 13, 2019Updated 6 years ago
- A PyTorch implementation of "R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection” (TMM2021)☆62Jun 4, 2020Updated 5 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆465Jul 20, 2022Updated 3 years ago
- HHH☆36May 2, 2022Updated 3 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Apr 25, 2019Updated 6 years ago
- Detect textlines in document images☆91May 27, 2024Updated last year
- Data and implementation of ECCV2020 paper 'Adaptive Text Recognition through Visual Matching'☆124Nov 22, 2022Updated 3 years ago
- OCR dataset Text-Detection dataset Font-Classification dataset generator☆149Mar 1, 2022Updated 4 years ago
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆422Jun 18, 2025Updated 8 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆97Apr 25, 2024Updated last year
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Sep 20, 2021Updated 4 years ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping"☆198Jul 28, 2024Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago