sarahalang / LLM-powered-OCR-correctionLinks
This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).
☆17Updated 11 months ago
Alternatives and similar repositories for LLM-powered-OCR-correction
Users that are interested in LLM-powered-OCR-correction are comparing it to the libraries listed below
Sorting:
- Ground Truth Resources for the HTR of patrimonial documents☆46Updated last week
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56Updated 2 years ago
- ☆51Updated last year
- Named Entity Recognition API used by TEI Publisher☆21Updated last year
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Updated last year
- Conversions between various OCR formats☆82Updated 2 years ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Updated 4 years ago
- HTRflow is the underlying engine for our HTR-pipeline☆69Updated last month
- Page-wise text recognition with lower-supervision line data models☆50Updated last week
- ☆39Updated last year
- Knowledge graph construction: Fast inserts into a Wikibase instance☆46Updated 3 years ago
- An OCR evaluation tool☆68Updated 5 months ago
- Instructions, exercises and example data sets for Annif hands-on tutorial☆43Updated 2 months ago
- Web application to build XML stand-off markup☆15Updated 4 years ago
- TEI Transviewer is an interface intended to the exploration of primary and secondary sources, at the document level, in historical or oth…☆14Updated 4 years ago
- You Actually Look Twice At it☆38Updated last year
- Named entity annotation tool☆28Updated 2 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆249Updated last week
- The main TEI Publisher app☆78Updated last week
- Python tools for performing various operations on ALTO XML files☆48Updated 10 months ago
- Named Entity Recognition☆18Updated 9 months ago
- Software for the development of EditionCrafter, digital critical edition publication tool☆21Updated 2 months ago
- A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️☆155Updated last year
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated last year
- A DH abstracts conversion tool☆12Updated 10 months ago
- Specifications for the DTS API☆32Updated last month
- This repo work as a sandbox enviroment for htrflow.☆38Updated 10 months ago
- Miscellaneous data analysis tools and scripts for the EHRI project☆15Updated last year
- An image annotation environment for the MARKUS platform☆54Updated last month