PedroBarcha / old-books-datasetLinks
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆12Updated 7 years ago
Alternatives and similar repositories for old-books-dataset
Users that are interested in old-books-dataset are comparing it to the libraries listed below
Sorting:
- ☆9Updated 5 years ago
- Repo to host the forms dataset☆15Updated 4 years ago
- ☆33Updated 4 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- ☆17Updated 10 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆61Updated last year
- ☆23Updated 7 months ago
- ☆22Updated 4 years ago
- OCR & Ground Truth Resources☆75Updated 3 years ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆29Updated 3 years ago
- OCR-D-compliant page segmentation☆67Updated 3 weeks ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Handwritten text recognition using transformers.☆158Updated 10 months ago
- ☆23Updated 2 years ago
- ☆17Updated 3 years ago
- ☆138Updated last year
- DFKI Layout Detection for OCR-D☆47Updated last month
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- TensorFlow implementation of a segmentation system for document images.☆34Updated 6 years ago
- Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)☆38Updated 6 years ago
- Convolutional recurrent neural network for scene text recognition or OCR in Keras☆125Updated 4 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆120Updated last year
- Offline Handwritten Text Recognition☆45Updated 5 years ago
- list all open dataset about ocr.☆101Updated 7 years ago
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 2 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Detect textlines in document images☆93Updated last year
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago