PedroBarcha / old-books-datasetLinks
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆12Updated 7 years ago
Alternatives and similar repositories for old-books-dataset
Users that are interested in old-books-dataset are comparing it to the libraries listed below
Sorting:
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- ☆9Updated 5 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Text and Layout Document Image Understanding. LayoutLM☆23Updated 3 years ago
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated last month
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆32Updated 2 years ago
- TextTron is a simple light-weight image processing based text detector for document images.☆52Updated 4 years ago
- Pretrained mixed models to be used with Calamari.☆63Updated 8 months ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆57Updated 9 months ago
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆132Updated 5 months ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- Convolutional recurrent neural network for scene text recognition or OCR in Keras☆125Updated 4 years ago
- A Large Dataset of Historical Japanese Documents with Complex Layouts☆34Updated 2 years ago
- ☆15Updated 5 years ago
- ☆23Updated 2 years ago
- OCR & Ground Truth Resources☆76Updated 3 years ago
- ☆3Updated 2 weeks ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- ☆25Updated 5 years ago
- ☆69Updated 7 years ago
- TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.☆56Updated 4 years ago
- Handwritten text recognition using transformers.☆158Updated 11 months ago
- document image degradation☆163Updated 5 years ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆38Updated 3 years ago
- Detect textlines in document images☆93Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated last year