HOCR manipulation and utility library; provides hocr2pdf binary.
☆14Mar 5, 2018Updated 7 years ago
Alternatives and similar repositories for python-hocr
Users that are interested in python-hocr are comparing it to the libraries listed below
Sorting:
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- QA-tool for scans with corresponding ALTO-files☆26Dec 2, 2022Updated 3 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 6 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- A MongoDB implementation of the W3C Web Annotation Protocol☆18Jun 3, 2022Updated 3 years ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 5 months ago
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Feb 20, 2018Updated 8 years ago
- JS for overlaying OCR on image using HOCR formatted HTML☆26Jul 30, 2016Updated 9 years ago
- Specifications for the DTS API☆33Feb 16, 2026Updated last week
- Simple app for visual editing of Page XML files☆31Sep 25, 2025Updated 5 months ago
- Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Libr…☆43Updated this week
- ☆10Apr 20, 2019Updated 6 years ago
- A place for design artifacts, stories, and feedback pertaining to Mirador ecosystem tools (especially Mirador 3).☆10Apr 4, 2019Updated 6 years ago
- The Linked Data Theatre is a platform for an optimal presentation of Linked Data☆38Feb 3, 2024Updated 2 years ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated last year
- Tamil Language words list☆12Jul 2, 2016Updated 9 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- An awesome list for Mirador's projects and plugins.☆45Feb 11, 2026Updated 2 weeks ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- (UNMAINTAINED) Tests for Linked Data Platform (LDP)☆23Jan 23, 2024Updated 2 years ago
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆10Updated this week
- A modern rendering engine for the web.☆12Feb 10, 2026Updated 2 weeks ago
- a little nodejs server and script that extracts letters from images via tesseract☆19Mar 4, 2015Updated 10 years ago
- XML builder macro letting you write XML inside Rust code☆10Nov 15, 2023Updated 2 years ago
- Utilities for working with RDF/Linked Data in JavaScript / TypeScript☆10Sep 12, 2022Updated 3 years ago
- Mongoose plugins search site☆19Mar 24, 2023Updated 2 years ago
- ☆11Aug 8, 2016Updated 9 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- JSON event parser is a simple streaming JSON parser and serializer implementation in Rust.☆14Feb 6, 2026Updated 3 weeks ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- ☆10Nov 2, 2016Updated 9 years ago
- Create zim packages out of regular websites☆10Jun 21, 2016Updated 9 years ago
- A reliable diacritics database with their associated ASCII characters☆13May 3, 2020Updated 5 years ago
- Abstract File Storage Connectors☆13Feb 10, 2026Updated 2 weeks ago
- Format specifiers to use with json-schema-validator☆12Jul 1, 2013Updated 12 years ago
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- JournalTouch provides a touch-optimized interface for browsing current journal tables of contents in Responsive Design. Fun!☆14May 27, 2019Updated 6 years ago
- ☆13Updated this week