Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
☆24Jan 30, 2021Updated 5 years ago
Alternatives and similar repositories for prima-page-converter
Users that are interested in prima-page-converter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35May 25, 2023Updated 2 years ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated 2 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 6 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Simple app for visual editing of Page XML files☆31Sep 25, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- An OCR evaluation tool☆69Aug 22, 2025Updated 7 months ago
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆17Sep 25, 2021Updated 4 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- ☆14Sep 12, 2019Updated 6 years ago
- Earley based parsing tools for XSLT☆10Oct 8, 2020Updated 5 years ago
- An invisible-XML processor for XQuery and XSLT☆14Jun 11, 2024Updated last year
- QA-tool for scans with corresponding ALTO-files☆26Dec 2, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Catalog of functional programming idioms (in XQuery 3.0)☆16Aug 4, 2016Updated 9 years ago
- CoffeePot releases and website pages, see nineml/nineml☆13Dec 26, 2025Updated 3 months ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- TensorFlow implementation of a segmentation system for document images.☆35Sep 9, 2018Updated 7 years ago
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 5 months ago
- XSLT Functions for Transpect☆13Mar 16, 2026Updated last week
- The CIS OCR PostCorrectionTool☆44Nov 7, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Useful XProc scripts☆13Oct 31, 2017Updated 8 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆196Updated this week
- ☆22Dec 6, 2018Updated 7 years ago
- Library to parse and create METS files, especially for Archivematica.☆23Feb 3, 2026Updated last month
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆60Apr 16, 2021Updated 4 years ago
- XPath/XQuery extension function library for the Saxon XSLT processor☆13Dec 14, 2022Updated 3 years ago