Document Understanding tools
☆21Dec 22, 2021Updated 4 years ago
Alternatives and similar repositories for TranskribusDU
Users that are interested in TranskribusDU are comparing it to the libraries listed below
Sorting:
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆28Nov 25, 2022Updated 3 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- Simple app for visual editing of Page XML files☆31Sep 25, 2025Updated 5 months ago
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- PAGE XML format collection for document image page content and more☆70Jan 16, 2026Updated last month
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆23Oct 18, 2024Updated last year
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 10 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- ☆10May 24, 2019Updated 6 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- Computer Vision and Deep Learning tutorials for the course Foundation of Digital Humanities☆10Dec 6, 2019Updated 6 years ago
- ☆28Jul 17, 2019Updated 6 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Sep 18, 2025Updated 5 months ago
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- fixed some errors from AirBernard/Scene-Text-Detection-with-SPCNET☆13Jul 29, 2019Updated 6 years ago
- Text Detection using Stroke Width Transform☆12Sep 13, 2014Updated 11 years ago
- A Dense Text Detection model using Receptive Field Blocks☆32Nov 21, 2022Updated 3 years ago
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- Tools for ICDAR2019 competitions(fifth place)☆11May 6, 2019Updated 6 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35May 25, 2023Updated 2 years ago
- Library in C++ and a python wrapper for dealing with Page XML files☆13Apr 25, 2025Updated 10 months ago
- ☆16Jan 19, 2023Updated 3 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusSwtGui☆18Oct 27, 2020Updated 5 years ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆195Updated this week
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆16Jan 28, 2026Updated last month
- Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"☆89Jan 14, 2022Updated 4 years ago
- Page to PAGE Layout Analysis Tool☆191Jan 17, 2022Updated 4 years ago
- QA-tool for scans with corresponding ALTO-files☆26Dec 2, 2022Updated 3 years ago
- ☆66Feb 3, 2026Updated last month
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 5 months ago
- ☆24Dec 8, 2022Updated 3 years ago
- An OpenSeadragon plugin that adds SVG overlay capability.☆62Aug 9, 2024Updated last year