pjox / gutfLinks
Terminal tool that converts files encoding to UTF-8
☆10Updated 5 years ago
Alternatives and similar repositories for gutf
Users that are interested in gutf are comparing it to the libraries listed below
Sorting:
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- ☆32Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Training files for Greek cursive script (in early print)☆14Updated 4 years ago
- Python tools for performing various operations on ALTO XML files☆47Updated 3 months ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Named Entity Recognition☆19Updated 2 months ago
- 🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.☆13Updated 2 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- You Actually Look Twice At it☆35Updated 4 months ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆22Updated 5 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated 2 years ago
- Named entity annotation tool☆28Updated last year
- Multi Tier Annotation Search☆26Updated 4 years ago
- tesseractXplore a tesseract ease of use gui with full control☆22Updated 3 years ago
- Knowledge graph construction: Fast inserts into a Wikibase instance☆45Updated 3 years ago
- Une série de programmes en python permettant de récupérer automatiquement des textes sur Gallica☆41Updated 5 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆10Updated last year
- A context-based spellchecker for correcting OCR output.☆19Updated 2 years ago
- QA-tool for scans with corresponding ALTO-files☆24Updated 2 years ago
- Ancient Greek lemmatisation tool☆22Updated 3 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- Python IMage MIning☆14Updated 2 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 2 weeks ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- OCR-D python tools☆33Updated 9 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆35Updated 3 years ago