machine-intelligence-laboratory/DDI-100

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/machine-intelligence-laboratory/DDI-100)

machine-intelligence-laboratory / DDI-100

Distorted Document Images dataset (DDI-100).

☆147

Alternatives and similar repositories for DDI-100

Users that are interested in DDI-100 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Academic-Hammer / SciTSR
View on GitHub
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
☆384Jul 7, 2020Updated 6 years ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
xiaoyu258 / DocProj
View on GitHub
Document Rectification and Illumination Correction using a Patch-based CNN
☆397Sep 28, 2022Updated 3 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
clovaai / CLEval
View on GitHub
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
☆187Oct 17, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆653Aug 12, 2024Updated last year
cs-chan / Total-Text-Dataset
View on GitHub
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one …
☆772Oct 5, 2023Updated 2 years ago
cvlab-stonybrook / DewarpNet
View on GitHub
Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)
☆622Nov 10, 2024Updated last year
ibm-aur-nlp / PubTabNet
View on GitHub
☆484Jul 8, 2025Updated last year
maxnth / LineAug
View on GitHub
Augment line images for improving OCR datasets
☆10Oct 4, 2023Updated 2 years ago
Canjie-Luo / Text-Image-Augmentation
View on GitHub
Geometric Augmentation for Text Image
☆494Apr 21, 2020Updated 6 years ago
shahrukhqasim / TIES-2.0
View on GitHub
Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Recognition using Graph Neural Networks (2019)
☆276Nov 22, 2022Updated 3 years ago
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,053Jul 9, 2025Updated last year
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
TianzhongSong / awesome-SynthText
View on GitHub
A curated list of awesome synthetic data for text location and recognition
☆337Jun 16, 2021Updated 5 years ago
clovaai / synthtiger
View on GitHub
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆579Jun 14, 2024Updated 2 years ago
doc-analysis / TableBank
View on GitHub
TableBank: A Benchmark Dataset for Table Detection and Recognition
☆1,080Aug 12, 2024Updated last year
Michael-Xiu / ICDAR-SROIE
View on GitHub
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆29Apr 25, 2019Updated 7 years ago
abdoelsayed2016 / TNCR_Dataset
View on GitHub
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Feb 24, 2024Updated 2 years ago
MhLiao / SynthText3D
View on GitHub
Project page of SynthText3D
☆150Dec 10, 2019Updated 6 years ago
Chuhanxx / FontAdaptor
View on GitHub
Data and implementation of ECCV2020 paper 'Adaptive Text Recognition through Visual Matching'
☆124Nov 22, 2022Updated 3 years ago
Wang-Tianwei / Implicit-feature-alignment
View on GitHub
Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".
☆67Jun 15, 2021Updated 5 years ago
LegalDocumentProcessing / FIR_Dataset_ICDAR2023
View on GitHub
☆12Jun 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Nikolai10 / doc-homography-generator
View on GitHub
Synthetic Dataset Generation: Recovering Homography from Camera Captured Documents
☆20May 13, 2019Updated 7 years ago
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago
xuewenyuan / TGRNet
View on GitHub
TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
☆105Dec 9, 2021Updated 4 years ago
HCIILAB / M5HisDoc
View on GitHub
☆34Dec 18, 2025Updated 7 months ago
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
qurator-spk / sbb_textline_detection
View on GitHub
Detect textlines in document images
☆90May 27, 2024Updated 2 years ago
phamquiluan / jdeskew
View on GitHub
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
☆168May 5, 2026Updated 2 months ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
weijiawu / Polygon-free-Unconstrained-Scene-Text-Detection-with-Box-Annotations
View on GitHub
Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training
☆34Nov 24, 2022Updated 3 years ago
BboyHanat / TextGenerator
View on GitHub
OCR dataset Text-Detection dataset Font-Classification dataset generator
☆149Mar 1, 2022Updated 4 years ago
Irene323 / GFTE
View on GitHub
A GCN-based table structure recognition method
☆226Mar 31, 2020Updated 6 years ago
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated 2 years ago
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
fh2019ustc / DocTr
View on GitHub
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆436Jul 10, 2026Updated 2 weeks ago
backtime92 / CRAFT-Reimplementation
View on GitHub
CRAFT-Pyotorch：Character Region Awareness for Text Detection Reimplementation for Pytorch
☆467Nov 18, 2021Updated 4 years ago