PedroBarcha / old-books-dataset
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆12Updated 7 years ago
Alternatives and similar repositories for old-books-dataset:
Users that are interested in old-books-dataset are comparing it to the libraries listed below
- ☆9Updated 5 years ago
- ☆21Updated 2 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 5 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- TensorFlow implementation of a segmentation system for document images.☆34Updated 6 years ago
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- OCR & Ground Truth Resources☆75Updated 2 years ago
- ☆25Updated 4 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated last year
- Quicksign OCRized Text Dataset (QS-OCR)☆44Updated 5 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆58Updated 5 years ago
- ☆15Updated 4 years ago
- ☆127Updated 4 years ago
- Close-Domain fine-tuning for table detection☆72Updated 2 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆60Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆120Updated last year
- DFKI Layout Detection for OCR-D☆47Updated this week
- ☆69Updated 7 years ago
- Document Image Binarization☆78Updated 5 months ago
- document image degradation☆163Updated 4 years ago
- ☆77Updated 2 years ago
- Handwritten text recognition using transformers.☆157Updated 8 months ago
- Convolutional recurrent neural network for scene text recognition or OCR in Keras☆125Updated 3 years ago
- Document Visual Question Answering☆115Updated 4 years ago
- list all open dataset about ocr.☆100Updated 7 years ago
- OCR-D-compliant page segmentation☆67Updated 3 weeks ago
- A curated list of papers and resources for scene text detection and recognition☆47Updated 5 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated 11 months ago