PedroBarcha / old-books-dataset
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆12Updated 7 years ago
Alternatives and similar repositories for old-books-dataset:
Users that are interested in old-books-dataset are comparing it to the libraries listed below
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆28Updated 5 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 3 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.☆56Updated 4 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆59Updated last year
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- ☆136Updated 11 months ago
- Using FCN to segment the book's content and background, then dewarping the pages,☆19Updated 3 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆116Updated last year
- PyThreshold is a python package featuring Numpy/Scipy implementations of state-of-the-art image thresholding algorithms.☆58Updated 2 years ago
- ☆17Updated 2 years ago
- Document Visual Question Answering☆114Updated 4 years ago
- ☆69Updated 6 years ago
- ☆21Updated 2 years ago
- Detect textlines in document images☆91Updated 8 months ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆32Updated 2 years ago
- Handwritten text recognition using transformers.☆155Updated 6 months ago
- ☆25Updated 4 years ago
- OCR-D-compliant page segmentation☆67Updated last week
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- ☆23Updated 4 months ago
- An application of high resolution GANs to dewarp images of perturbed documents☆133Updated 3 years ago
- ☆33Updated 4 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆11Updated last year
- ☆127Updated 4 years ago
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Updated 2 years ago