PRImA-Research-Lab / prima-page-converterLinks
Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
☆24Updated 4 years ago
Alternatives and similar repositories for prima-page-converter
Users that are interested in prima-page-converter are comparing it to the libraries listed below
Sorting:
- Converters for various file formats used for representing OCR☆12Updated 4 months ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Updated 5 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 3 months ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 4 months ago
- guides and test data for OCR4all☆32Updated 2 years ago
- A repository for online OCRD training infrastructure.☆13Updated 5 years ago
- Conversions between various OCR formats☆80Updated 2 years ago
- Check your modified Ground Truth files with visual support!☆10Updated last year
- ☆31Updated 2 weeks ago
- Python tools for performing various operations on ALTO XML files☆48Updated 6 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Training files for Greek cursive script (in early print)☆15Updated 4 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Simple app for visual editing of Page XML files☆31Updated 4 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Text Overlay plugin for Mirador 3☆57Updated last month
- An extensible viewer for OCR-D mets.xml files☆21Updated last year
- ☆60Updated last month
- Highlighting various OCR formats directly in Solr☆86Updated last week
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- The CIS OCR PostCorrectionTool☆43Updated 2 years ago
- OCR-D python tools☆33Updated last year
- ☆13Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆196Updated 3 months ago
- ☆14Updated 3 years ago