Augment line images for improving OCR datasets
☆10Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for LineAug
Users that are interested in LineAug are comparing it to the libraries listed below
Sorting:
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 9 months ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 3 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 9 months ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Wrapper for the kraken OCR engine☆12Jul 12, 2025Updated 7 months ago
- Polytonic Greek OCR tool suite based on Ocropus 0.7☆13Jul 5, 2023Updated 2 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Jul 31, 2017Updated 8 years ago
- ☆20Aug 18, 2019Updated 6 years ago
- An Editor for creating simple or complex OCR workflows☆17Jun 13, 2024Updated last year
- TensorFlow implementation of a segmentation system for document images.☆35Sep 9, 2018Updated 7 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 9 months ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Feb 20, 2018Updated 8 years ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 2 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Nov 16, 2025Updated 3 months ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Feb 21, 2018Updated 8 years ago
- Document Understanding tools☆21Dec 22, 2021Updated 4 years ago
- ☆17Sep 25, 2021Updated 4 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Jul 18, 2019Updated 6 years ago
- ☆12Jan 7, 2023Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆199May 21, 2025Updated 9 months ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- OCR-D-compliant page segmentation☆68Nov 19, 2025Updated 3 months ago
- RObust document image BINarization☆184Aug 2, 2024Updated last year
- Simple app for visual editing of Page XML files☆31Sep 25, 2025Updated 5 months ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Nov 3, 2017Updated 8 years ago
- A tutorial on the PyTorch-based ocropus components.☆73Apr 18, 2020Updated 5 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 3 months ago