jbrinley / HocrConverterLinks
Create PDFs and plain text from hOCR documents
☆34Updated 14 years ago
Alternatives and similar repositories for HocrConverter
Users that are interested in HocrConverter are comparing it to the libraries listed below
Sorting:
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated 2 weeks ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 2 weeks ago
- Library to parse and create METS files, especially for Archivematica.☆21Updated 2 weeks ago
- ALTO XML schema - latest and all former versions☆52Updated 10 months ago
- Python module for easing the construction of JSON manifests compliant with IIIF API 3.0.☆20Updated 4 months ago
- Python tools for performing various operations on ALTO XML files☆47Updated 3 months ago
- Simple IIIF Search service for OCRed texts☆16Updated 4 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Updated 2 years ago
- IIIF Examples and useful code☆19Updated this week
- QA-tool for scans with corresponding ALTO-files☆24Updated 2 years ago
- Documentation and use cases for ALTO XML☆41Updated 6 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API☆34Updated 2 years ago
- Efficient hOCR tooling☆44Updated last month
- View HOCR files with Mirador☆29Updated 7 years ago
- extract text from ALTO file☆9Updated last year
- IIIF Presentation API implementation in Python☆36Updated last year
- Conversions between various OCR formats☆78Updated 2 years ago
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusSwtGui☆18Updated 4 years ago
- A Python script which generates YAML files intended to accompany HathiTrust submissions. Includes documentation about types of data reque…☆9Updated 8 years ago
- Command-line client for the DataCite Metadata Store (MDS)☆17Updated 4 years ago
- ☆56Updated this week
- An awesome list for Mirador's projects and plugins.☆44Updated last year
- python library for working with IIIF Image and Presentation APIs☆20Updated 4 months ago
- ☆44Updated this week
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆16Updated 7 years ago
- Komplexní validátor☆8Updated 3 weeks ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 9 months ago
- Command-line tools to transform TEI & METS files to IIIF Presentation API manifests☆19Updated 8 years ago
- Request and workflow management for archives and special collections☆17Updated 4 years ago