sarahalang / LLM-powered-OCR-correctionLinks
This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).
☆15Updated 8 months ago
Alternatives and similar repositories for LLM-powered-OCR-correction
Users that are interested in LLM-powered-OCR-correction are comparing it to the libraries listed below
Sorting:
- Ground Truth Resources for the HTR of patrimonial documents☆45Updated last week
 - METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆54Updated 2 years ago
 - Knowledge graph construction: Fast inserts into a Wikibase instance☆46Updated 3 years ago
 - ☆50Updated last year
 - A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️☆142Updated last year
 - An image annotation environment for the MARKUS platform☆48Updated last week
 - HTRflow is the underlying engine for our HTR-pipeline☆65Updated 4 months ago
 - An OCR evaluation tool☆68Updated 2 months ago
 - Named Entity Recognition API used by TEI Publisher☆21Updated last year
 - ☆32Updated 2 years ago
 - Vocabseditor is a web-based tool for collaborative work on controlled vocabularies development☆25Updated last month
 - ☆38Updated last year
 - Named entity annotation tool☆28Updated 2 years ago
 - Named Entity Recognition☆18Updated 6 months ago
 - This repo work as a sandbox enviroment for htrflow.☆38Updated 7 months ago
 - Conversions between various OCR formats☆81Updated 2 years ago
 - Web application to build XML stand-off markup☆15Updated 4 years ago
 - Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
 - A web app for creating and editing ODD documents☆27Updated 3 weeks ago
 - Repository hosting the common code for the entity-fishing clients☆10Updated 4 months ago
 - A collection of open source tools and resources related to Wikibase knowledge graphs☆73Updated last month
 - RDF Transform is an extension for OpenRefine to transform data into RDF formats.☆36Updated last month
 - TEI Transviewer is an interface intended to the exploration of primary and secondary sources, at the document level, in historical or oth…☆13Updated 4 years ago
 - nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated last year
 - Miscellaneous data analysis tools and scripts for the EHRI project☆15Updated last year
 - Research Environment for Ancient Documents☆43Updated 3 weeks ago
 - Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Updated last year
 - DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.☆23Updated 3 weeks ago
 - Software for the development of EditionCrafter, digital critical edition publication tool☆20Updated last month
 - Cours de python enseigné à l'École nationale des Chartes☆34Updated 4 years ago