tberg12/ocular

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tberg12/ocular)

tberg12 / ocular

Ocular is a state-of-the-art historical OCR system.

☆270

Alternatives and similar repositories for ocular

Users that are interested in ocular are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ryanfb / book-aligner
View on GitHub
Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.
☆37May 8, 2026Updated 2 months ago
wolfgangmm / tei-simple-pm
View on GitHub
An implementation of the TEI Simple ODD extensions for processing models in XQuery.
☆22Jul 24, 2019Updated 7 years ago
bengler / propinquity
View on GitHub
Pipeline for image classification at The Norwegian National Museum and zooming display mechanism.
☆14Nov 3, 2017Updated 8 years ago
seuretm / ocrd_typegroups_classifier
View on GitHub
☆10Mar 16, 2023Updated 3 years ago
jze / ocropus-model_fraktur
View on GitHub
OCRopus model for Gothic print (Fraktur)
☆19Feb 16, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
textcreationpartnership / Texts
View on GitHub
the EEBO TCP texts
☆37Feb 21, 2018Updated 8 years ago
iulibdcs / tei_text
View on GitHub
Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …
☆26Apr 19, 2017Updated 9 years ago
UB-Mannheim / ocr-gt-tools
View on GitHub
Ergonomic line-by-line transcription of scanned text.
☆53Feb 2, 2026Updated 5 months ago
ryanfb / ancientgreekocr-ocr-evaluation-tools
View on GitHub
'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
☆23Feb 21, 2018Updated 8 years ago
Early-Modern-OCR / hOCR-De-Noising
View on GitHub
code to remove "noise" from hOCR output of Tesseract OCR.
☆14Oct 24, 2016Updated 9 years ago
dhlab-epfl / dhSegment
View on GitHub
Generic framework for historical document processing
☆383Jul 9, 2021Updated 5 years ago
OpenPhilology / nidaba
View on GitHub
An expandable and scalable OCR pipeline
☆90Nov 14, 2017Updated 8 years ago
Early-Modern-OCR / TesseractTraining
View on GitHub
Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)
☆37Sep 24, 2015Updated 10 years ago
ocropus-archive / DUP-ocropy
View on GitHub
Python-based tools for document analysis and OCR
☆3,466May 22, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ocropus / hocr-tools
View on GitHub
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
☆416Aug 10, 2024Updated last year
mdlincoln / ulanr
View on GitHub
Reconcile artist names to the Getty Union List of Artist Names
☆20Oct 10, 2016Updated 9 years ago
CITlabRostock / citlab-article-separation-new
View on GitHub
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…
☆22Sep 2, 2022Updated 3 years ago
jronallo / iiif-image
View on GitHub
Node modules for working with the IIIF Image API
☆15Aug 26, 2016Updated 9 years ago
armadillo-systems / inquire
View on GitHub
iNQUIRE is a digital research platform, designed to surface any digital repository using any metadata schema. Coded in HTML5, and leverag…
☆11Mar 3, 2023Updated 3 years ago
iiif-prezi / osullivan
View on GitHub
IIIF Presentation API for Ruby
☆33Dec 10, 2025Updated 7 months ago
oxygenxml / TEI-Facsimile-Plugin
View on GitHub
A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…
☆25Jun 16, 2025Updated last year
tedunderwood / DataMunging
View on GitHub
Scripts that clean up OCR and munge Hathi metadata.
☆78Nov 4, 2017Updated 8 years ago
arttracks / elysa
View on GitHub
CMOA Provenance entry tool
☆15Aug 1, 2015Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PRImA-Research-Lab / prima-core-libs
View on GitHub
Core libraries by the PRImA Research Lab
☆16Jul 30, 2024Updated last year
benedikt-budig / glyph-miner
View on GitHub
Glyph Miner, a system for extracting glyphs from early typeset prints
☆34Sep 29, 2016Updated 9 years ago
glenrobson / SimpleAnnotationServer
View on GitHub
A simple IIIF and Mirador compatible Annotation Server
☆104Mar 1, 2026Updated 4 months ago
cisocrgroup / OCR-Workshop
View on GitHub
Presentations, tutorials and data for the OCR workshop at LMU
☆16Jun 2, 2017Updated 9 years ago
instituutnederlandsetaal / OpenConvert
View on GitHub
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
☆23Feb 11, 2022Updated 4 years ago
allofthenorthwood / synesthesia
View on GitHub
An app for viewing grapheme-to-color synesthesia sets (if you have no idea what that means, check Wikipedia! It's pretty cool.)
☆13Oct 16, 2021Updated 4 years ago
kba / awesome-ocr
View on GitHub
Links to awesome OCR projects
☆3,112Jul 6, 2024Updated 2 years ago
dasmiq / passim
View on GitHub
Detect and align similar passages
☆122Apr 27, 2026Updated 2 months ago
NYPL / oral-history
View on GitHub
NYPL Oral History Project
☆16Mar 4, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ocropus-archive / DUP-ocropy2
View on GitHub
Next generation OCR engine based on LSTMs.
☆51Apr 8, 2018Updated 8 years ago
an-rahulpandey / speech-recognition-plugin-ios
View on GitHub
Speech Recognition Plugin for Phonegap based on https://github.com/currycat/SpeechToText.git
☆13Jan 13, 2015Updated 11 years ago
rogerhoward / lambdazoom
View on GitHub
LambdaZoom is a Python-based AWS Lambda function which converts uploaded images to the Deep Zoom tiled image format supported by OpenSead…
☆10Feb 4, 2022Updated 4 years ago
filak / hOCR-to-ALTO
View on GitHub
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
☆60Mar 20, 2026Updated 4 months ago
ryanfb / iiif-dl
View on GitHub
Command-line tile downloader/assembler for IIIF endpoints/manifests
☆35Apr 8, 2026Updated 3 months ago
JonathanReeve / corpus-list
View on GitHub
A structured list of text corpora, created for use with a corpus downloader.
☆13Aug 27, 2016Updated 9 years ago
KBNLresearch / ochre
View on GitHub
Toolbox for OCR post-correction
☆120Sep 19, 2019Updated 6 years ago