METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)
☆56May 30, 2023Updated 2 years ago
Alternatives and similar repositories for nautilusocr
Users that are interested in nautilusocr are comparing it to the libraries listed below
Sorting:
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated 2 years ago
- Conversions between various OCR formats☆83Feb 13, 2026Updated 2 weeks ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 6 months ago
- Library to parse and create METS files, especially for Archivematica.☆23Feb 3, 2026Updated 3 weeks ago
- ☆14Jul 11, 2022Updated 3 years ago
- Discovering IIIF manifests☆19May 16, 2023Updated 2 years ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 5 months ago
- Self hosting code for Recogito-Studio☆20Oct 16, 2025Updated 4 months ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Feb 6, 2026Updated 3 weeks ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Updated this week
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 8 months ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 10 months ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 6 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- OCR a IIIF images in a manifest and generate annotations☆26Feb 11, 2025Updated last year
- OCR-D python tools☆33Aug 16, 2024Updated last year
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆17Updated this week
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated 2 weeks ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 7 months ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- You Actually Look Twice At it☆38Jan 21, 2025Updated last year
- A platform for the display, enrichment, and curation of IIIF-based digital objects☆56Updated this week
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- ☆141Mar 5, 2024Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- A web annotation server built with the same principles as Git☆44Feb 18, 2026Updated last week
- OCR post correction for old German corpus☆19Aug 29, 2022Updated 3 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Python module for easing the construction of JSON manifests compliant with IIIF API 3.0.☆20Jan 21, 2026Updated last month
- Highlighting various OCR formats directly in Solr☆87Updated this week