PRImA-Research-Lab / prima-page-converterLinks
Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
☆24Updated 4 years ago
Alternatives and similar repositories for prima-page-converter
Users that are interested in prima-page-converter are comparing it to the libraries listed below
Sorting:
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- OCR-D wrapper for prima-pagetopdf☆9Updated last week
- Python tools for performing various operations on ALTO XML files☆47Updated 3 months ago
- Converters for various file formats used for representing OCR☆12Updated last month
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 10 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated last week
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Updated 5 years ago
- Augment line images for improving OCR datasets☆9Updated last year
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 3 weeks ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated 11 months ago
- An extensible viewer for OCR-D mets.xml files☆21Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Named entity annotation tool☆28Updated last year
- ☆13Updated 2 years ago
- Training files for Greek cursive script (in early print)☆14Updated 4 years ago
- Check your modified Ground Truth files with visual support!☆10Updated last year
- Conversions between various OCR formats☆77Updated 2 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆19Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆73Updated 2 weeks ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Updated 4 months ago
- OCR-D python tools☆33Updated 9 months ago
- ☆31Updated last month
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 4 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- You Actually Look Twice At it☆35Updated 4 months ago
- A repository for online OCRD training infrastructure.☆13Updated 4 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- ☆11Updated 4 years ago