PRImA-Research-Lab / prima-page-converter
Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
☆23Updated 4 years ago
Alternatives and similar repositories for prima-page-converter:
Users that are interested in prima-page-converter are comparing it to the libraries listed below
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Updated 4 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Converters for various file formats used for representing OCR☆12Updated 11 months ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 2 weeks ago
- Training files for Greek cursive script (in early print)☆14Updated 3 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 8 months ago
- Python tools for performing various operations on ALTO XML files☆45Updated 3 weeks ago
- OCR-D wrapper for prima-pagetopdf☆9Updated last week
- Conversions between various OCR formats☆74Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Named entity annotation tool☆27Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Check your modified Ground Truth files with visual support!☆10Updated last year
- ☆10Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Augment line images for improving OCR datasets☆9Updated last year
- ☆31Updated 2 months ago
- ☆14Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆20Updated 9 months ago
- ☆12Updated 2 years ago
- You Actually Look Twice At it☆31Updated 2 months ago
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- ☆58Updated last month
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated 2 years ago
- OCR-D python tools☆33Updated 7 months ago