PedroBarcha / old-books-datasetLinks
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆12Updated 7 years ago
Alternatives and similar repositories for old-books-dataset
Users that are interested in old-books-dataset are comparing it to the libraries listed below
Sorting:
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 2 years ago
- Text and Layout Document Image Understanding. LayoutLM☆23Updated 3 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- ☆15Updated 5 years ago
- ☆9Updated 5 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- ☆23Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Pretrained mixed models to be used with Calamari.☆63Updated 9 months ago
- ☆69Updated 7 years ago
- OCR-D-compliant page segmentation☆67Updated last month
- Detect textlines in document images☆93Updated last year
- ☆25Updated 5 years ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- TextTron is a simple light-weight image processing based text detector for document images.☆52Updated 4 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆121Updated last year
- Handwritten text recognition using transformers.☆158Updated 11 months ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Python package for Stroke Width Transform - Localizing the Text (Letters & Words) in a Natural Image☆38Updated last year
- ☆127Updated 5 years ago
- TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.☆56Updated 4 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆59Updated 5 years ago
- ☆22Updated 4 years ago
- DFKI Layout Detection for OCR-D☆47Updated 2 months ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 4 years ago
- Evaluation of the Layoutlm model on the CORD dataset☆32Updated 3 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- ☆33Updated 4 years ago
- Quicksign OCRized Text Dataset (QS-OCR)☆45Updated 6 years ago
- ☆17Updated 3 years ago