OCR-D/ocrd_segment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OCR-D/ocrd_segment)

OCR-D / ocrd_segment

OCR-D-compliant page segmentation

☆67

Alternatives and similar repositories for ocrd_segment

Users that are interested in ocrd_segment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
bertsky / ocrd_publaynet
View on GitHub
convert PubLayNet data into METS/PAGE-XML
☆10Mar 17, 2020Updated 6 years ago
andbue / nashi
View on GitHub
Some bits of javascript to transcribe scanned pages using PageXML
☆17May 27, 2026Updated last month
qurator-spk / mods4pandas
View on GitHub
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
☆15Aug 21, 2025Updated 10 months ago
seuretm / ocrd_typegroups_classifier
View on GitHub
☆10Mar 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qurator-spk / sbb_textline_detection
View on GitHub
Detect textlines in document images
☆90May 27, 2024Updated 2 years ago
OCR-D / ocrd_all
View on GitHub
Master repository which includes most other OCR-D repositories as submodules
☆73Jul 4, 2025Updated last year
jze / ocropus-model_fraktur
View on GitHub
OCRopus model for Gothic print (Fraktur)
☆19Feb 16, 2020Updated 6 years ago
cisocrgroup / ocrd_cis
View on GitHub
OCR-D python tools
☆33Aug 16, 2024Updated last year
mauvilsa / nw-page-editor
View on GitHub
Simple app for visual editing of Page XML files
☆31Sep 25, 2025Updated 9 months ago
ulb-sachsen-anhalt / ulb-zeitungsprojekt-hp1
View on GitHub
Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"
☆12Dec 17, 2021Updated 4 years ago
qurator-spk / sbb_pixelwise_segmentation
View on GitHub
Obsolete repo, merged into eynollah
☆12Sep 29, 2025Updated 9 months ago
hnesk / browse-ocrd
View on GitHub
An extensible viewer for OCR-D mets.xml files
☆23May 30, 2024Updated 2 years ago
OCR4all / LAREX
View on GitHub
A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
☆198Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mittagessen / curt
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
qurator-spk / sbb_binarization
View on GitHub
Document Image Binarization
☆80Oct 17, 2024Updated last year
kba / transkribus-to-prima
View on GitHub
Convert Transkribus PAGE-XML to standard PAGE-XML
☆12Dec 10, 2025Updated 7 months ago
qurator-spk / dinglehopper
View on GitHub
An OCR evaluation tool
☆70Aug 22, 2025Updated 10 months ago
PRImA-Research-Lab / PAGE-XML
View on GitHub
PAGE XML format collection for document image page content and more
☆71Jan 16, 2026Updated 6 months ago
cneud / alto-tools
View on GitHub
Python tools for performing various operations on ALTO XML files
☆50Jun 12, 2026Updated last month
omni-us / pagexml
View on GitHub
Library in C++ and a python wrapper for dealing with Page XML files
☆13Apr 25, 2025Updated last year
OCR-D / ocrd_pagetopdf
View on GitHub
OCR-D wrapper for prima-pagetopdf
☆10Oct 30, 2025Updated 8 months ago
OCR-D / page-to-alto
View on GitHub
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
☆17Jun 5, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
glenrobson / iiif_stuff
View on GitHub
IIIF Examples and useful code
☆20Sep 10, 2025Updated 10 months ago
qurator-spk / neat
View on GitHub
Named entity annotation tool
☆28Jul 6, 2023Updated 3 years ago
qurator-spk / eynollah
View on GitHub
Document Layout Analysis
☆407Updated this week
lquirosd / P2PaLA
View on GitHub
Page to PAGE Layout Analysis Tool
☆192Jan 17, 2022Updated 4 years ago
NVlabs / ocrodeg
View on GitHub
document image degradation
☆165May 18, 2020Updated 6 years ago
UB-Mannheim / GTCheck
View on GitHub
Check your modified Ground Truth files with visual support!
☆10Jan 31, 2024Updated 2 years ago
maxnth / LineAug
View on GitHub
Augment line images for improving OCR datasets
☆10Oct 4, 2023Updated 2 years ago
bertsky / ocrd_detectron2
View on GitHub
OCR-D wrapper for detectron2 based segmentation models
☆16May 1, 2025Updated last year
zamazan4ik / PRLib
View on GitHub
Pre-Recognition Library - library with algorithms for improving OCR quality.
☆38Mar 20, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jbaiter / archiscribe
View on GitHub
Web application for transcribing OCR ground truth from Archive.org
☆18Feb 22, 2018Updated 8 years ago
OCR-D / ocrd_calamari
View on GitHub
Recognize text using Calamari OCR and the OCR-D framework
☆16May 13, 2025Updated last year
benedikt-budig / glyph-miner
View on GitHub
Glyph Miner, a system for extracting glyphs from early typeset prints
☆34Sep 29, 2016Updated 9 years ago
DocYard-ai / UCR
View on GitHub
Universal Character Recognizer (UCR): Simple, Intuitive, Extensible, Multi-Lingual OCR engine
☆15Apr 23, 2021Updated 5 years ago
OCR-D / ocrd-website
View on GitHub
☆24Jun 9, 2026Updated last month
impresso / named-entity-tutorial-dh2019
View on GitHub
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
☆24Jul 18, 2019Updated 7 years ago
ajgallego / document-image-binarization
View on GitHub
A selectional auto-encoder approach for document image binarization
☆104Dec 8, 2022Updated 3 years ago