QA-tool for scans with corresponding ALTO-files
☆26Dec 2, 2022Updated 3 years ago
Alternatives and similar repositories for quack
Users that are interested in quack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35May 25, 2023Updated 2 years ago
- An awesome list for Mirador's projects and plugins.☆45Feb 11, 2026Updated last month
- Rails engine for working with storage of OpenAnnotations stored in Fedora4☆13Aug 4, 2016Updated 9 years ago
- HOCR manipulation and utility library; provides hocr2pdf binary.☆14Mar 5, 2018Updated 8 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 5 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Polytonic Greek OCR tool suite based on Ocropus 0.7☆13Jul 5, 2023Updated 2 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- interactive, customizable semantic web visualization☆15Dec 27, 2025Updated 2 months ago
- TEI Publisher Extension for Visual Studio Code☆17Jan 22, 2026Updated 2 months ago
- ALTO XML schema - latest and all former versions☆55Jan 20, 2026Updated 2 months ago
- Command line tool for linking civil registries☆14Feb 13, 2026Updated last month
- JS for overlaying OCR on image using HOCR formatted HTML☆26Jul 30, 2016Updated 9 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- Harvard University Library Cloud API☆11Feb 25, 2022Updated 4 years ago
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27Nov 11, 2025Updated 4 months ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 6 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Tools for TICCL☆14Dec 12, 2025Updated 3 months ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated last year
- Scripts, data and results for TEI Hackathon☆12Oct 31, 2015Updated 10 years ago
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆37Feb 20, 2026Updated last month
- Documentation and Materials for the Zotero Workshops. Written in Pandoc enhanced Markdown.☆24Nov 6, 2017Updated 8 years ago
- This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by …☆28Apr 25, 2013Updated 12 years ago
- Named Entity Recognition API used by TEI Publisher☆21May 21, 2024Updated last year
- gathering point for open source OCR scripts and diffs☆43Jun 27, 2014Updated 11 years ago
- A static site generator for TEI Publisher☆13Mar 8, 2022Updated 4 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- Text Overlay plugin for Mirador 3☆61Feb 14, 2026Updated last month
- Using social media to steer web archiving and curation.☆18Nov 20, 2015Updated 10 years ago
- ☆23Nov 10, 2017Updated 8 years ago
- XML:DB Initiative for XML Databases☆18Updated this week