ja-mcm / OCRfixr
A context-based spellchecker for correcting OCR output.
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OCRfixr
- Python 3 library for processing historical English☆64Updated 3 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- A Named-Entity Recogniser based on Grobid.☆49Updated last month
- ☆32Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- UIMA CAS processing library written in Python☆85Updated 6 months ago
- Named Entity Recognition☆16Updated this week
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆13Updated 5 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- Python Multilingual Ucrel Semantic Analysis System☆30Updated 2 months ago
- You Actually Look Twice At it☆29Updated last month
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 6 months ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Data for the HIPE 2022 shared task.☆15Updated 11 months ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆16Updated last year
- Python tools for performing various operations on ALTO XML files☆39Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- OCR post correction for old German corpus☆19Updated 2 years ago
- Digital Humanities Across Borders☆46Updated 7 months ago
- High-performance text aligner for large collections of texts☆45Updated 2 weeks ago
- ☆64Updated last year
- Repository for the Georgetown University Multilayer Corpus (GUM)☆88Updated 2 weeks ago
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated last year