The CIS OCR PostCorrectionTool
☆44Nov 7, 2022Updated 3 years ago
Alternatives and similar repositories for PoCoTo
Users that are interested in PoCoTo are comparing it to the libraries listed below
Sorting:
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 4 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆23Feb 21, 2018Updated 8 years ago
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- Digital Humanities course site☆21Nov 22, 2021Updated 4 years ago
- A set of (string) distance functions written in JavaScript / Python / PHP.☆18Feb 2, 2026Updated last month
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Sep 20, 2021Updated 4 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- Polytonic Greek OCR tool suite based on Ocropus 0.7☆13Jul 5, 2023Updated 2 years ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆60Apr 16, 2021Updated 4 years ago
- An end-user environment for working with data in the CITE environment—browsing and analyzing texts, viewing objects and images, visualizi…☆15May 5, 2020Updated 5 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- Tools for TICCL☆14Dec 12, 2025Updated 3 months ago
- Turn CTS TEI corpora into CEX collection files☆12Jun 16, 2021Updated 4 years ago
- Miscellaneous Jupyter notebooks and slides for public talks☆11Jan 7, 2019Updated 7 years ago
- A bunch of modules that use/extend CLTK in order to work with Greek and Latin corpora maintained by the Perseus DL☆12Oct 26, 2019Updated 6 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆195Mar 14, 2026Updated last week
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 5 months ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- JS / Python3 / PHP Lib to work with UTF8 polytonic greek and latin☆10Sep 11, 2024Updated last year
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- Mannheim library utilities☆27Dec 29, 2025Updated 2 months ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 10 months ago
- Greek texts (eventually) with linguistic annotation (for Greek Learner Texts Project)☆15Jun 16, 2023Updated 2 years ago
- IIIF Presentation API implementation in Python☆35Apr 17, 2024Updated last year
- Visual Text Analytics for Digital Humanities☆17Apr 22, 2015Updated 10 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Mar 26, 2019Updated 6 years ago
- Polytonic Greek OCR engine derived from Gamera and based on the work of Dalitz and Brandt☆33Nov 25, 2014Updated 11 years ago
- Teaching materials for AR Methodological Workshop - Computational Background Skills for Digital Humanities at University of Vienna.☆43Mar 5, 2026Updated 2 weeks ago