PRImA-Research-Lab / prima-page-converterLinks
Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
☆24Updated 4 years ago
Alternatives and similar repositories for prima-page-converter
Users that are interested in prima-page-converter are comparing it to the libraries listed below
Sorting:
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Updated 5 years ago
- Converters for various file formats used for representing OCR☆12Updated 6 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆57Updated last month
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 6 months ago
- Check your modified Ground Truth files with visual support!☆10Updated last year
- Conversions between various OCR formats☆81Updated 2 years ago
- ☆32Updated 2 months ago
- guides and test data for OCR4all☆32Updated 3 years ago
- Python tools for performing various operations on ALTO XML files☆48Updated 8 months ago
- Training files for Greek cursive script (in early print)☆15Updated 4 years ago
- A repository for online OCRD training infrastructure.☆13Updated 5 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated 2 weeks ago
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Updated last year
- Text Overlay plugin for Mirador 3☆59Updated 3 weeks ago
- ☆61Updated last week
- Simple app for visual editing of Page XML files☆31Updated last month
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆28Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆22Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- OCR-D python tools☆33Updated last year
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- Augment line images for improving OCR datasets☆10Updated 2 years ago
- An OCR evaluation tool☆68Updated 2 months ago
- PAGE XML format collection for document image page content and more☆68Updated 4 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago