sarahalang/LLM-powered-OCR-correction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sarahalang/LLM-powered-OCR-correction)

sarahalang / LLM-powered-OCR-correction

This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).

☆20

Alternatives and similar repositories for LLM-powered-OCR-correction

Users that are interested in LLM-powered-OCR-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ieg-dhr / NLP-Course4Humanities_2024
View on GitHub
This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…
☆20Jun 5, 2025Updated last year
achimrabus / polyscriptor
View on GitHub
Multi-engine ATR for multiple languages and scripts
☆17Jul 10, 2026Updated last week
AI-Riksarkivet / htrflow
View on GitHub
HTRflow is the underlying engine for our HTR-pipeline
☆77Apr 9, 2026Updated 3 months ago
aso2101 / prakrit_texts
View on GitHub
Digital texts in Prakrit
☆11Sep 14, 2025Updated 10 months ago
jbaiter / pdiiif
View on GitHub
Create PDFs from IIIF manifests, completely client-side (with server-based fallback for unsupported browsers)
☆50Oct 4, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
tobiasvanderwerff / MetaHTR
View on GitHub
Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).
☆14Jun 22, 2022Updated 4 years ago
cwrc / ontology
View on GitHub
CWRC ontology - primary repository
☆13Jul 8, 2026Updated last week
rodighiero / weather-map
View on GitHub
The Weather Map is a visual model inspired by the synoptic weather charts for the map of controversies.
☆11Feb 7, 2024Updated 2 years ago
ahmedsaeedsaid / OCR-Arabic
View on GitHub
Arabic OCR OCR system for Arabic language that converts images (multi-fonts) of typed text to machine-encoded text. The system currently …
☆11Oct 12, 2021Updated 4 years ago
alix-tz / escriptorium-documentation
View on GitHub
Source code to eScriptorium Documentation's website (powered with Mkdocs)
☆16Jun 1, 2026Updated last month
diyclassics / la_core_web_lg
View on GitHub
spaCy-compatible sm/md/lg/trf core models for Latin, i.e pipeline with POS tagger, morphologizer, lemmatizer, dependency parser, and NER
☆12Aug 26, 2025Updated 10 months ago
SymposiumOrganization / ControllableNeuralSymbolicRegression
View on GitHub
☆13Oct 11, 2023Updated 2 years ago
DISSINET / InkVisitor
View on GitHub
An open-source, browser-based front-end application for the collection of complex structured data from textual resources in history and t…
☆17Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HTR-United / htr-united
View on GitHub
Ground Truth Resources for the HTR of patrimonial documents
☆49Updated this week
araichev / kml2geojson
View on GitHub
Moved to https://codeberg.org/araichev/kml2geojson.
☆16May 5, 2026Updated 2 months ago
saimj7 / Handwritten-Text-Recognition-in-Real-Time
View on GitHub
Testing out HTR-OCR-Text translation using Google's Tesseract engine in real-time.
☆20Oct 6, 2020Updated 5 years ago
onaci / folium-glify-layer
View on GitHub
Folium plugin to provide fast webgl rendering for GeoJSON FeatureCollections
☆15Jun 28, 2024Updated 2 years ago
mittagessen / party
View on GitHub
Page-wise text recognition with lower-supervision line data models
☆54Jun 12, 2026Updated last month
kba / transkribus-to-prima
View on GitHub
Convert Transkribus PAGE-XML to standard PAGE-XML
☆12Dec 10, 2025Updated 7 months ago
cpietsch / vikus-IIIF-generator
View on GitHub
VIKUS IIIF Generator
☆17Oct 28, 2025Updated 8 months ago
Black-JL / Research-Project-Flow
View on GitHub
☆16Jul 9, 2026Updated last week
bbostock / Switchbot_Py_Meter
View on GitHub
Python script to read temperature and humidity from Switchbot Meter and send via MQTT (for Home Assistant etc)
☆20Jan 9, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
erlangen-crm / ecrm
View on GitHub
Erlangen CRM - An OWL implementation of the CIDOC Conceptual Reference Model
☆46Sep 20, 2024Updated last year
IrinaArmstrong / HandwrittenTextRecognition
View on GitHub
Offline Handwritten Text Recognition (HTR) system
☆19Aug 8, 2019Updated 6 years ago
Systemik-Solutions / glycerine-viewer
View on GitHub
A VUE IIIF viewer
☆15Jun 5, 2026Updated last month
morrisalp / taatiknet
View on GitHub
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
☆16Jun 27, 2023Updated 3 years ago
madeyoga / Handwritten-Text-Recognition
View on GitHub
Train a Text Recognition CRNN model with Tensorflow2 & Keras & IAM Dataset. Convolutional Recurrent Neural Network. CTC.
☆21May 7, 2020Updated 6 years ago
projectEndings / Endings
View on GitHub
Core repository for the project
☆18Sep 26, 2025Updated 9 months ago
OCR-D / ocrd_pagetopdf
View on GitHub
OCR-D wrapper for prima-pagetopdf
☆10Oct 30, 2025Updated 8 months ago
RongLiu-Leo / 2d-gaussian-splatting
View on GitHub
Supercharge 2DGS with SIBR monitor
☆23Oct 1, 2025Updated 9 months ago
GLAM-Workbench / glam-workbench.github.io
View on GitHub
☆29Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Princeton-CDH / geniza
View on GitHub
version 4.x of the Princeton Geniza Project
☆13Jul 9, 2026Updated last week
uniwue-zpd / PAGETools
View on GitHub
Small collection of PAGE XML related scripts used at the ZPD Würzburg
☆12Aug 2, 2024Updated last year
ym001 / distancia
View on GitHub
The DistanceMetrics package is a comprehensive Python library designed to compute a wide variety of distance metrics between two vectors,…
☆21Sep 25, 2025Updated 9 months ago
aourednik / text2landscape
View on GitHub
Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps
☆21Feb 11, 2022Updated 4 years ago
skohub-io / skohub-pages
View on GitHub
☆21Feb 10, 2026Updated 5 months ago
davanstrien / ocr-bench
View on GitHub
Per-collection OCR leaderboards using VLM-as-judge
☆68Updated this week
BOberreither / INTRO
View on GitHub
INTRO - an Intertextual, Interpictorial, and Intermedial Relations Ontology for literary studies
☆17Nov 5, 2025Updated 8 months ago