OpenPhilology/nidaba

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenPhilology/nidaba)

OpenPhilology / nidaba

An expandable and scalable OCR pipeline

☆90

Alternatives and similar repositories for nidaba

Users that are interested in nidaba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ASVLeipzig / cor-asv-fst
View on GitHub
OCR-D post-correction module based on weighted finite-state transducers
☆11Jan 13, 2024Updated 2 years ago
cisocrgroup / Resources
View on GitHub
Manuals, lexica, OCR test data for PoCoTo and the profiler
☆15Jul 2, 2021Updated 5 years ago
ryanfb / ancientgreekocr-ocr-evaluation-tools
View on GitHub
'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
☆23Feb 21, 2018Updated 8 years ago
ocropus-archive / DUP-ocropy2
View on GitHub
Next generation OCR engine based on LSTMs.
☆51Apr 8, 2018Updated 8 years ago
seuretm / ocrd_typegroups_classifier
View on GitHub
☆10Mar 16, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jbaiter / archiscribe
View on GitHub
Web application for transcribing OCR ground truth from Archive.org
☆18Feb 22, 2018Updated 8 years ago
cisocrgroup / ocrd_cis
View on GitHub
OCR-D python tools
☆33Aug 16, 2024Updated last year
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
cisocrgroup / PoCoTo
View on GitHub
The CIS OCR PostCorrectionTool
☆45Nov 7, 2022Updated 3 years ago
kitodo / kitodo-publication
View on GitHub
Kitodo.Publication
☆14Updated this week
Early-Modern-OCR / hOCR-De-Noising
View on GitHub
code to remove "noise" from hOCR output of Tesseract OCR.
☆14Oct 24, 2016Updated 9 years ago
distributed-text-services / distributed-text-services.github.io
View on GitHub
☆11Feb 13, 2026Updated 5 months ago
mauvilsa / tesseract-recognize
View on GitHub
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
☆47Mar 31, 2025Updated last year
hnesk / browse-ocrd
View on GitHub
An extensible viewer for OCR-D mets.xml files
☆23May 30, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
internetarchive / archive-hocr-tools
View on GitHub
Efficient hOCR tooling
☆57Aug 18, 2025Updated 11 months ago
wiseman / energid_nlp
View on GitHub
Natural language parsers and conceptual memory
☆15Aug 2, 2012Updated 13 years ago
PRImA-Research-Lab / prima-core-libs
View on GitHub
Core libraries by the PRImA Research Lab
☆16Jul 30, 2024Updated last year
UB-Mannheim / ocr-gt-tools
View on GitHub
Ergonomic line-by-line transcription of scanned text.
☆53Feb 2, 2026Updated 5 months ago
brobertson / ciaconna
View on GitHub
Polytonic Greek OCR tool suite based on Ocropus 0.7
☆13Jul 5, 2023Updated 3 years ago
mittagessen / kraken
View on GitHub
OCR engine for all the languages
☆1,039Updated this week
tmbarchive / ocropus3-docker
View on GitHub
Docker container for ocropus3 OCR system
☆13Aug 19, 2018Updated 7 years ago
andbue / nashi
View on GitHub
Some bits of javascript to transcribe scanned pages using PageXML
☆17May 27, 2026Updated last month
Doreenruirui / okralact
View on GitHub
A repository for online OCRD training infrastructure.
☆13Aug 20, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UB-Mannheim / GTCheck
View on GitHub
Check your modified Ground Truth files with visual support!
☆10Jan 31, 2024Updated 2 years ago
TEIC / Hackathon
View on GitHub
Scripts, data and results for TEI Hackathon
☆12Oct 31, 2015Updated 10 years ago
filak / hOCR-to-ALTO
View on GitHub
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
☆60Mar 20, 2026Updated 4 months ago
smurp / huviz
View on GitHub
interactive, customizable semantic web visualization
☆15Dec 27, 2025Updated 6 months ago
kitodo / kitodo-presentation
View on GitHub
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Libr…
☆44Updated this week
benedikt-budig / glyph-miner
View on GitHub
Glyph Miner, a system for extracting glyphs from early typeset prints
☆34Sep 29, 2016Updated 9 years ago
ASVLeipzig / cor-asv-ann
View on GitHub
OCR-D post-correction with encoder-attention-decoder LSTMs
☆13May 1, 2025Updated last year
zamazan4ik / PRLib
View on GitHub
Pre-Recognition Library - library with algorithms for improving OCR quality.
☆38Mar 20, 2021Updated 5 years ago
bertsky / ocrd_detectron2
View on GitHub
OCR-D wrapper for detectron2 based segmentation models
☆16May 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cneud / ocr-conversion
View on GitHub
Conversions between various OCR formats
☆84Feb 13, 2026Updated 5 months ago
NCSU-Libraries / ocracoke
View on GitHub
Rails application supporting the creation of OCR and the IIIF Content Search API
☆33Dec 14, 2022Updated 3 years ago
tmallon / morpheus
View on GitHub
Transform Greek and Latin texts into morphology databases using Perseus' Morpheus service.
☆17Aug 8, 2014Updated 11 years ago
pharos-alexandria / ocr-greek_cursive
View on GitHub
Training files for Greek cursive script (in early print)
☆15May 26, 2021Updated 5 years ago
mauvilsa / nw-page-editor
View on GitHub
Simple app for visual editing of Page XML files
☆31Sep 25, 2025Updated 10 months ago
CITlabRostock / citlab-article-separation-new
View on GitHub
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…
☆22Sep 2, 2022Updated 3 years ago
qurator-spk / dinglehopper
View on GitHub
An OCR evaluation tool
☆70Aug 22, 2025Updated 11 months ago