A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LECTAUREP project.
☆17May 18, 2022Updated 3 years ago
Alternatives and similar repositories for page2tei
Users that are interested in page2tei are comparing it to the libraries listed below
Sorting:
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- The School of Salamanca. Web Application☆15Feb 16, 2026Updated 2 weeks ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Annotation tool (NER) for XML documents (TEI, EAD) - WIP☆11Jul 22, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- An awesome list for Mirador's projects and plugins.☆45Feb 11, 2026Updated 3 weeks ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Mar 1, 2026Updated last week
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 8 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 6 months ago
- Conversions between various OCR formats☆84Feb 13, 2026Updated 3 weeks ago
- Miscellaneous data analysis tools and scripts for the EHRI project☆16Jan 25, 2024Updated 2 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 5 months ago
- Named Entity Recognition☆19Feb 13, 2026Updated 3 weeks ago
- Named Entity Recognition API used by TEI Publisher☆21May 21, 2024Updated last year
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- The main TEI Publisher app☆78Updated this week
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Validator for the Presentation API☆47Updated this week
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆23Dec 1, 2021Updated 4 years ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 10 months ago
- Text collections made available by the CLiGS group.☆24Mar 22, 2022Updated 3 years ago
- Java Domain Models for all IIIF APIs☆28Feb 21, 2026Updated 2 weeks ago
- A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection☆63Feb 23, 2026Updated 2 weeks ago
- Mannheim library utilities☆27Dec 29, 2025Updated 2 months ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- EFES (EpiDoc Front End Services) is a custom and readily customizable platform for publication and search/indexing of EpiDoc files, based…☆33Feb 5, 2026Updated last month
- ConedaKOR – store.manage.retrieve.☆31Feb 26, 2026Updated last week
- TIFY is a slim and mobile-friendly IIIF document viewer.☆123Feb 28, 2026Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 8 months ago
- Django web application to display, annotate, and export digitized books.☆33Updated this week
- You Actually Look Twice At it☆39Jan 21, 2025Updated last year
- A crowdsourcing website reassembling the social network of early modern Britain☆34Nov 6, 2018Updated 7 years ago