sparkfish / shabby-pagesLinks

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.

☆59

Alternatives and similar repositories for shabby-pages

Users that are interested in shabby-pages are comparing it to the libraries listed below

Sorting:

xiaomore / Document-Image-Dewarping
☆57Updated last year
phamquiluan / jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
☆142Updated 2 months ago
dali92002 / DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆167Updated 6 months ago
ayanban011 / SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆73Updated 10 months ago
FelixHertlein / inv3d-model
Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…
☆50Updated last year
fh2019ustc / DocGeoNet
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
☆85Updated last month
cxfyxl / VIPTR
☆41Updated last year
arvindrajan92 / DTrOCR
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
☆181Updated last month
ZZZHANG-jx / GCDRNet
[TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild
☆41Updated last year
czczup / FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆203Updated 2 months ago
TomStog / curved-text-alignment
A function that takes as input a cropped text line image, and outputs the dewarped image.
☆20Updated 9 months ago
DVLP-CMATERJU / RectiNet
A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping
☆106Updated 2 years ago
dmitrijsk / AttentionHTR
Attention-based sequence-to-sequence model for handwritten word recognition
☆61Updated 10 months ago
lmmx / page-dewarp
Document image dewarping library using a cubic sheet model
☆166Updated this week
thomasjhuang / deep-learning-for-document-dewarping
An application of high resolution GANs to dewarp images of perturbed documents
☆142Updated 3 years ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆129Updated 2 years ago
gwxie / Dewarping-Document-Image-By-Displacement-Flow-Estimation
Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network
☆179Updated 2 years ago
chongzhangFDU / ROOR
This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…
☆26Updated 8 months ago
facebookresearch / MultiplexedOCR
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Updated 2 years ago
tanguymagne / UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
☆153Updated last year
fh2019ustc / DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆398Updated last month
qurator-spk / sbb_binarization
Document Image Binarization
☆77Updated 9 months ago
kartikgill / Easter2
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
☆79Updated 2 years ago
ZeningLin / PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆36Updated 4 months ago
georgeretsi / HTR-best-practices
Basic HTR concepts/modules to boost performance
☆32Updated 8 months ago
machine-intelligence-laboratory / DDI-100
Distorted Document Images dataset (DDI-100).
☆139Updated 2 years ago
cvlab-stonybrook / PaperEdge
The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
☆131Updated last year
poloclub / tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Updated last year
MaxKinny / TabRecSet
A large scale camera-taken table detection and recognition dataset.
☆136Updated 2 weeks ago
FelixHertlein / doc-matcher
Project page for the WACV 2025 Paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching".
☆29Updated last month