GarfieldLyu / OCR_POST_DEView external linksLinks
OCR post correction for old German corpus
☆19Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for OCR_POST_DE
Users that are interested in OCR_POST_DE are comparing it to the libraries listed below
Sorting:
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Neo4j powered web application for multimedia collections: bring graph-based exploration and crowd-based indexation.☆24May 1, 2020Updated 5 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 2 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- QualiAnon is a tool to support the anonymization of text data. It is developed by the Qualiservice research data center for the anonymiza…☆33May 27, 2025Updated 8 months ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- ☆15Jan 9, 2019Updated 7 years ago
- Named Entity Recognition☆18Apr 9, 2025Updated 10 months ago
- Coreference resolution for German☆16Jun 26, 2017Updated 8 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 9 months ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- A generic webservice to extract RDF statements from XML resources☆19May 21, 2024Updated last year
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Feb 6, 2026Updated last week
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- (ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper☆88May 25, 2023Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 9 months ago
- tesseractXplore a tesseract ease of use gui with full control☆27Nov 10, 2021Updated 4 years ago
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated last year
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26May 10, 2021Updated 4 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Mar 21, 2022Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56May 30, 2023Updated 2 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 5 months ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Sep 20, 2021Updated 4 years ago
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Updated this week
- Toolbox for OCR post-correction☆122Sep 19, 2019Updated 6 years ago
- the EEBO TCP texts☆36Feb 21, 2018Updated 7 years ago
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆28Nov 25, 2022Updated 3 years ago
- Blazing fast topic modelling for short texts.☆35Jan 5, 2026Updated last month