jbrinley / HocrConverterLinks
Create PDFs and plain text from hOCR documents
☆35Updated 15 years ago
Alternatives and similar repositories for HocrConverter
Users that are interested in HocrConverter are comparing it to the libraries listed below
Sorting:
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Updated 3 months ago
- The CIS OCR PostCorrectionTool☆44Updated 3 years ago
- Conversions between various OCR formats☆82Updated 2 years ago
- EFES (EpiDoc Front End Services) is a custom and readily customizable platform for publication and search/indexing of EpiDoc files, based…☆33Updated 11 months ago
- Python tools for performing various operations on ALTO XML files☆48Updated 10 months ago
- TIFY is a slim and mobile-friendly IIIF document viewer.☆122Updated 3 weeks ago
- Specifications for the DTS API☆32Updated last month
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆198Updated 7 months ago
- IIIF Presentation API implementation in Python☆34Updated last year
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- An awesome list for Mirador's projects and plugins.☆45Updated last year
- IIIF Examples and useful code☆19Updated 4 months ago
- Text Overlay plugin for Mirador 3☆61Updated last month
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Library to parse and create METS files, especially for Archivematica.☆23Updated this week
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Updated 3 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Updated 8 years ago
- ALTO XML schema - latest and all former versions☆55Updated last month
- QA-tool for scans with corresponding ALTO-files☆26Updated 3 years ago
- Training files for Greek cursive script (in early print)☆15Updated 4 years ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 3 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 8 months ago
- Process, enhance and evaluate multiple OCR output.☆24Updated last month
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusCore☆37Updated 5 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 3 months ago
- Simple IIIF Search service for OCRed texts☆17Updated 5 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API☆34Updated 3 years ago
- process MARC records from Python☆257Updated 5 years ago
- python library for working with IIIF Image and Presentation APIs☆21Updated 2 months ago
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year