shrutirij/ocr-post-correction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shrutirij/ocr-post-correction)

shrutirij / ocr-post-correction

☆141

Alternatives and similar repositories for ocr-post-correction

Users that are interested in ocr-post-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jarobyte91 / post_ocr_correction
View on GitHub
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆39Dec 2, 2023Updated 2 years ago
ruathudo / post-ocr-correction
View on GitHub
☆11Nov 14, 2021Updated 4 years ago
mittagessen / curt
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
nrc-cnrc / gramble
View on GitHub
Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…
☆17Updated this week
kba / transkribus-to-prima
View on GitHub
Convert Transkribus PAGE-XML to standard PAGE-XML
☆12Dec 10, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GarfieldLyu / OCR_POST_DE
View on GitHub
OCR post correction for old German corpus
☆20Aug 29, 2022Updated 3 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
omni-us / pagexml
View on GitHub
Library in C++ and a python wrapper for dealing with Page XML files
☆13Apr 25, 2025Updated last year
bertsky / ocrd_detectron2
View on GitHub
OCR-D wrapper for detectron2 based segmentation models
☆16May 1, 2025Updated last year
PonteIneptique / YALTAi
View on GitHub
You Actually Look Twice At it
☆42Apr 15, 2026Updated 3 months ago
natliblux / nautilusocr
View on GitHub
METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)
☆56May 30, 2023Updated 3 years ago
ryanfb / ancientgreekocr-ocr-evaluation-tools
View on GitHub
'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
☆23Feb 21, 2018Updated 8 years ago
hnesk / browse-ocrd
View on GitHub
An extensible viewer for OCR-D mets.xml files
☆23May 30, 2024Updated 2 years ago
andbue / nashi
View on GitHub
Some bits of javascript to transcribe scanned pages using PageXML
☆17May 27, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
knaw-huc / pagexml
View on GitHub
☆17Jan 16, 2026Updated 6 months ago
FactoDeepLearning / DAN
View on GitHub
☆12Jun 13, 2025Updated last year
jbaiter / pdiiif
View on GitHub
Create PDFs from IIIF manifests, completely client-side (with server-based fallback for unsupported browsers)
☆50Oct 4, 2025Updated 9 months ago
mikahama / natas
View on GitHub
Python 3 library for processing historical English
☆68Aug 10, 2024Updated last year
KBNLresearch / ochre
View on GitHub
Toolbox for OCR post-correction
☆120Sep 19, 2019Updated 6 years ago
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
tzhengus / ManchuDict
View on GitHub
A simple dictionary in Manchu, Chinese and English.
☆14Feb 27, 2015Updated 11 years ago
ocropus-archive / DUP-cctc
View on GitHub
Simple CTC implementation for PyTorch
☆14Oct 25, 2017Updated 8 years ago
JKamlah / tesseractXplore
View on GitHub
tesseractXplore a tesseract ease of use gui with full control
☆26Nov 10, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OCR-D / page-to-alto
View on GitHub
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
☆17Jun 5, 2026Updated last month
impresso / named-entity-tutorial-dh2019
View on GitHub
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
☆24Jul 18, 2019Updated 7 years ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
UniversalDependencies / UD_German-GSD
View on GitHub
☆20May 6, 2026Updated 2 months ago
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
antonisa / embeddings
View on GitHub
Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages
☆15Apr 11, 2020Updated 6 years ago
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
kmike / dialog2017
View on GitHub
☆10Jul 21, 2017Updated 9 years ago
Living-with-machines / DeezyMatch
View on GitHub
A Flexible Deep Learning Approach to Fuzzy String Matching
☆152Oct 16, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
qurator-spk / neat
View on GitHub
Named entity annotation tool
☆28Jul 6, 2023Updated 3 years ago
ahmetustun / udapter
View on GitHub
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…
☆31Dec 5, 2022Updated 3 years ago
dasmiq / cs7180-sp2024
View on GitHub
Special Topics in AI: Artificial Intelligence as an Archival Science
☆21May 13, 2024Updated 2 years ago
youhyunjo / manchu-spell
View on GitHub
Manchu Spell Checker
☆17Dec 6, 2024Updated last year
qurator-spk / sbb_ocr_postcorrection
View on GitHub
Two-Step Approach to OCR Post-Correction
☆14May 24, 2024Updated 2 years ago
sanjibnarzary / awesome-llm
View on GitHub
Curated list of open source and openly accessible large language models
☆26Jul 16, 2023Updated 3 years ago