scstech85/DocEmul

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scstech85/DocEmul)

scstech85 / DocEmul

A Toolkit to Generate Structured Historical Documents

☆15

Alternatives and similar repositories for DocEmul

Users that are interested in DocEmul are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
hnesk / browse-ocrd
View on GitHub
An extensible viewer for OCR-D mets.xml files
☆23May 30, 2024Updated 2 years ago
taeho-kil / Scene-Text-Rectification
View on GitHub
Scene text rectification using glyph and character alignment properties
☆22Jan 21, 2018Updated 8 years ago
tmbarchive / ocropus3-docker
View on GitHub
Docker container for ocropus3 OCR system
☆12Aug 19, 2018Updated 7 years ago
FactoDeepLearning / DAN
View on GitHub
☆12Jun 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ulb-sachsen-anhalt / ulb-zeitungsprojekt-hp1
View on GitHub
Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"
☆12Dec 17, 2021Updated 4 years ago
Sanster / DeepAndroidOcr
View on GitHub
Offline android OCR app using deep learning
☆22Sep 7, 2018Updated 7 years ago
gudovskiy / fmap_compression
View on GitHub
Code for DNN feature map compression paper
☆11Nov 21, 2018Updated 7 years ago
PRImA-Research-Lab / prima-page-to-pdf
View on GitHub
Java command line tool to convert PAGE XML files with layout and text content to PDF
☆10Apr 27, 2020Updated 6 years ago
computervision8 / FSFNet
View on GitHub
☆11Nov 19, 2020Updated 5 years ago
nicokaiser / bandoneon
View on GitHub
A little JavaScript application that wants to help learning the bandoneon.
☆21Updated this week
tmbarchive / ocropus
View on GitHub
The OCRopus OCR System
☆11Dec 17, 2014Updated 11 years ago
OCR4all / getting_started
View on GitHub
guides and test data for OCR4all
☆32Oct 4, 2022Updated 3 years ago
BrunoKrinski / awesome-data-augmentation
View on GitHub
A set of awesome content about Data Augmentation for Deep Learning and other stuff!!!
☆15Nov 27, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MathieuTurcotte / msparser
View on GitHub
Parser for Valgrind's massif.out file format.
☆20Mar 17, 2013Updated 13 years ago
qurator-spk / sbb_textline_detection
View on GitHub
Detect textlines in document images
☆90May 27, 2024Updated 2 years ago
UB-Mannheim / Fibeln
View on GitHub
Transkriptionen von Fibeln (19. Jahrhundert)
☆11Oct 31, 2025Updated 8 months ago
kba / transkribus-to-prima
View on GitHub
Convert Transkribus PAGE-XML to standard PAGE-XML
☆12Dec 10, 2025Updated 7 months ago
weijiawu / Polygon-free-Unconstrained-Scene-Text-Detection-with-Box-Annotations
View on GitHub
Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training
☆34Nov 24, 2022Updated 3 years ago
krksgbr / glyphcollector
View on GitHub
☆64Jan 4, 2023Updated 3 years ago
TNishimoto / lzrr
View on GitHub
A new lossless data compression algorithm
☆12Nov 19, 2025Updated 8 months ago
OCR-D / format-converters
View on GitHub
Converters for various file formats used for representing OCR
☆12Apr 30, 2025Updated last year
yoheioka / mighty-scraper
View on GitHub
Template for creating a scraper that saves to Google Sheets, fires Slack notifications, and is scheduled using AWS Lambda and CloudWatch
☆10Dec 27, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NVlabs / ocropus3
View on GitHub
Repository collecting all the submodules for the new PyTorch-based OCR System.
☆141Feb 22, 2021Updated 5 years ago
ctensmeyer / dibco_2017
View on GitHub
Submission for DIBCO 2017
☆16Sep 11, 2017Updated 8 years ago
Jumpst3r / printed-hw-segmentation
View on GitHub
Printed and handwritten text segmentation using fully convolutional networks and CRF post-processing
☆46Jan 14, 2021Updated 5 years ago
UB-Mannheim / GTCheck
View on GitHub
Check your modified Ground Truth files with visual support!
☆10Jan 31, 2024Updated 2 years ago
syedsaqibbukhari / docanalysis
View on GitHub
☆10Aug 5, 2019Updated 6 years ago
texttechnologylab / textimager-corpus2wiki
View on GitHub
☆11Sep 17, 2021Updated 4 years ago
idhmc-tamu / eMOP
View on GitHub
files and code related to the Early Modern OCR Project (eMOP) at the IDHMC
☆16Oct 2, 2014Updated 11 years ago
stuartemiddleton / glosat_table_dataset
View on GitHub
GloSAT Historical Measurement Table Dataset
☆11Dec 3, 2025Updated 7 months ago
vvuonghn / AI_DocumentLayoutAnalysis
View on GitHub
AI_DocumentLayoutAnalysis
☆39Nov 25, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
UB-Mannheim / blatt
View on GitHub
NLP-helper for OCR-ed pages in PAGE XML format
☆10Dec 6, 2024Updated last year
k-int / gokb-phase1
View on GitHub
Original GOKb repo - Moving to https://github.com/openlibraryenvironment/gokb
☆11Jan 23, 2018Updated 8 years ago
Transkribus / TranskribusBaseLineEvaluationScheme
View on GitHub
☆10Oct 12, 2020Updated 5 years ago
dictcp / jump.sh
View on GitHub
a simple script for ssh to AWS EC2 nodes based on Name Tag and Instance ID, with tab auto-completion
☆11Aug 26, 2020Updated 5 years ago
OCR-D / page-to-alto
View on GitHub
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
☆17Jun 5, 2026Updated last month
DIVA-DIA / DIVA-DAF
View on GitHub
Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.
☆19Nov 7, 2024Updated last year
Corion / HID-LoupedeckCT
View on GitHub
Perl driver for the Loupedeck CT keyboard
☆13Nov 1, 2025Updated 8 months ago