TEI4HTR / page2teiView external linksLinks
A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LECTAUREP project.
☆16May 18, 2022Updated 3 years ago
Alternatives and similar repositories for page2tei
Users that are interested in page2tei are comparing it to the libraries listed below
Sorting:
- ☆32Aug 29, 2025Updated 5 months ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- The School of Salamanca. Web Application☆15Feb 5, 2026Updated last week
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Annotation tool (NER) for XML documents (TEI, EAD) - WIP☆11Jul 22, 2022Updated 3 years ago
- An awesome list for Mirador's projects and plugins.☆45Feb 14, 2024Updated 2 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Feb 8, 2026Updated last week
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 8 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 5 months ago
- Conversions between various OCR formats☆82May 13, 2023Updated 2 years ago
- Cours de python enseigné à l'École nationale des Chartes☆37Jul 6, 2021Updated 4 years ago
- Miscellaneous data analysis tools and scripts for the EHRI project☆15Jan 25, 2024Updated 2 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 4 months ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 9 months ago
- The main TEI Publisher app☆78Updated this week
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Validator for the Presentation API☆47Updated this week
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆23Dec 1, 2021Updated 4 years ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 9 months ago
- Text collections made available by the CLiGS group.☆24Mar 22, 2022Updated 3 years ago
- A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection☆63Feb 5, 2026Updated last week
- Java Domain Models for all IIIF APIs☆27Feb 1, 2026Updated 2 weeks ago
- Mannheim library utilities☆27Dec 29, 2025Updated last month
- An OCR evaluation tool☆69Aug 22, 2025Updated 5 months ago
- EFES (EpiDoc Front End Services) is a custom and readily customizable platform for publication and search/indexing of EpiDoc files, based…☆33Feb 5, 2026Updated last week
- ☆39Jun 6, 2024Updated last year
- TIFY is a slim and mobile-friendly IIIF document viewer.☆123Feb 3, 2026Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 7 months ago
- You Actually Look Twice At it☆38Jan 21, 2025Updated last year
- Django web application to display, annotate, and export digitized books.☆33Feb 3, 2026Updated last week
- A crowdsourcing website reassembling the social network of early modern Britain☆34Nov 6, 2018Updated 7 years ago
- Page-wise text recognition with lower-supervision line data models☆51Updated this week