You Actually Look Twice At it
☆39Jan 21, 2025Updated last year
Alternatives and similar repositories for YALTAi
Users that are interested in YALTAi are comparing it to the libraries listed below
Sorting:
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Page-wise text recognition with lower-supervision line data models☆51Updated this week
- ☆17Feb 27, 2026Updated last week
- ☆14Jul 11, 2022Updated 3 years ago
- Miscellaneous data analysis tools and scripts for the EHRI project☆16Jan 25, 2024Updated 2 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- ☆11Jun 13, 2025Updated 8 months ago
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆23Dec 1, 2021Updated 4 years ago
- Conversions between various OCR formats☆84Feb 13, 2026Updated 3 weeks ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Annotation tool (NER) for XML documents (TEI, EAD) - WIP☆11Jul 22, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Mar 1, 2026Updated last week
- ☆28Sep 26, 2023Updated 2 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 3 months ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Repository of documentation about the open datasets published by the UK Web Archive.☆15Jun 21, 2019Updated 6 years ago
- Use any vision LLMs to perform OCR using LangChain☆18Jul 29, 2025Updated 7 months ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 8 months ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Software for Tagging Entities in TEI-Files automatically☆16Oct 21, 2025Updated 4 months ago
- ☆39Jun 6, 2024Updated last year
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆16Mar 5, 2024Updated 2 years ago
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆14Jan 25, 2022Updated 4 years ago
- Computer vision platform for the Digital Humanities☆26Updated this week
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆17Updated this week
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56May 30, 2023Updated 2 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- ☆23Aug 13, 2023Updated 2 years ago
- Code for the DeepScript Submission to ICFHR2016 Competition on the Classification of Medieval Handwritings in Latin Script☆18Nov 23, 2016Updated 9 years ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 4 years ago
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆17May 18, 2022Updated 3 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆15Jun 4, 2024Updated last year