pjox / gutfLinks
Terminal tool that converts files encoding to UTF-8
☆10Updated 6 years ago
Alternatives and similar repositories for gutf
Users that are interested in gutf are comparing it to the libraries listed below
Sorting:
- Named Entity Recognition☆18Updated 10 months ago
- ☆33Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56Updated 2 years ago
- Repository hosting the common code for the entity-fishing clients☆10Updated 7 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated 2 years ago
- Conversions between various OCR formats☆82Updated 2 years ago
- Python tools for performing various operations on ALTO XML files☆48Updated 11 months ago
- Named entity annotation tool☆28Updated 2 years ago
- You Actually Look Twice At it☆38Updated last year
- An OCR evaluation tool☆68Updated 5 months ago
- OCR-D python tools☆33Updated last year
- A context-based spellchecker for correcting OCR output.☆21Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 8 months ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- Data from the Integrating Digital Papyrology project☆74Updated this week
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- DTA Base Format (DTABf)☆18Updated 10 months ago
- Process, enhance and evaluate multiple OCR output.☆24Updated 2 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Updated 4 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Updated last week
- Multi Tier Annotation Search☆26Updated 4 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆21Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆26Updated 4 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆199Updated 8 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 7 months ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆25Updated 5 years ago
- Detect and align similar passages☆117Updated 4 months ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆15Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago