internetarchive / archive-hocr-toolsLinks
Efficient hOCR tooling
☆50Updated 3 months ago
Alternatives and similar repositories for archive-hocr-tools
Users that are interested in archive-hocr-tools are comparing it to the libraries listed below
Sorting:
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- Conversions between various OCR formats☆81Updated 2 years ago
- Process, enhance and evaluate multiple OCR output.☆24Updated last year
- Python tools for performing various operations on ALTO XML files☆48Updated 8 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆57Updated last month
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆196Updated 6 months ago
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆35Updated 4 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year
- Create PDFs from IIIF manifests, completely client-side (with server-based fallback for unsupported browsers)☆46Updated last month
- tesseractXplore a tesseract ease of use gui with full control☆24Updated 4 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- View HOCR files with Mirador