Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim
☆37Apr 24, 2026Updated last week
Alternatives and similar repositories for escriptorium
Users that are interested in escriptorium are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 6 months ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 6 years ago
- Page-wise text recognition with lower-supervision line data models☆53Mar 30, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Named Entity Recognition☆19Feb 13, 2026Updated 2 months ago
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated last year
- ☆14Jul 11, 2022Updated 3 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆17Jan 20, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Mar 24, 2026Updated last month
- xslt template documentation generator☆10Apr 19, 2018Updated 8 years ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 6 months ago
- ☆28Dec 10, 2025Updated 4 months ago
- A curated list of awesome RDM resources for researchers and organisations☆31Mar 2, 2026Updated 2 months ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆60Mar 20, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 29, 2026Updated last week
- The official Music Performance Markup graphical editor and analysis tool☆10Apr 22, 2026Updated 2 weeks ago
- ☆24Apr 8, 2026Updated 3 weeks ago
- Arabic handwriting dataset and starter code for deep learning study group☆15Oct 9, 2017Updated 8 years ago
- ALTO XML schema - latest and all former versions☆55Jan 20, 2026Updated 3 months ago
- Mannheim library utilities☆27Dec 29, 2025Updated 4 months ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- An OCR evaluation tool☆70Aug 22, 2025Updated 8 months ago
- The CIS OCR PostCorrectionTool☆44Nov 7, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Feb 5, 2026Updated 3 months ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 5 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- Web application that powers weber-gesamtausgabe.de☆24Updated this week
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- Power Query M Functions to Access United States Census Bureau API☆13Nov 6, 2020Updated 5 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago