Python tools for Tesseract OCR training
☆26May 2, 2022Updated 4 years ago
Alternatives and similar repositories for pytesstrain
Users that are interested in pytesstrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- ☆15Jul 11, 2022Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated 2 years ago
- An approximate nearest-neighbor search for text reuse.☆12Oct 5, 2020Updated 5 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 7 months ago
- ☆28May 26, 2026Updated 2 weeks ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆16Dec 6, 2025Updated 6 months ago
- Use any vision LLMs to perform OCR using LangChain☆22Jul 29, 2025Updated 10 months ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆16Jan 25, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Page-wise text recognition with lower-supervision line data models☆53Updated this week
- Knowledge graph construction: Fast inserts into a Wikibase instance☆46Feb 3, 2022Updated 4 years ago
- vertx tcp eventbus client module for python☆12Aug 5, 2016Updated 9 years ago
- Thai Law Dataset (Act of Parliament)☆25Jul 21, 2021Updated 4 years ago
- Kong OAuth SSO Integration☆17Aug 23, 2017Updated 8 years ago
- Enables a pair of phones that have front facing cameras to share text using QR codes.☆14Apr 3, 2022Updated 4 years ago
- Orchestrate web crawlers to create structured datasets from multiple data sources with YAML configs.☆16Dec 8, 2022Updated 3 years ago
- A Python library to add reconstructed pronunciations of Middle Chinese on Chinese texts☆11Mar 13, 2023Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Jun 5, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Tesseract tessdata downloader from GitHub repositories☆11Sep 17, 2021Updated 4 years ago
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 8 years ago
- Automation of NOAA satellite reception☆14May 14, 2025Updated last year
- Tutorial on how to create metrics dashboards like the THOR Dashboard☆14Mar 8, 2017Updated 9 years ago
- OxGarage is an web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transfor…☆53Sep 18, 2015Updated 10 years ago
- An open source reference management tool developed in PyQt5 and Python3.☆13Feb 15, 2026Updated 4 months ago
- Recognition Models for Kraken and CLSTM☆17Aug 21, 2019Updated 6 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆47Mar 31, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Vagrantfile with a Mac OSX host that makes sound work with alsa on Ubuntu/Trusty guest. I'm using this box for the Coursera course Audio …☆11Nov 22, 2015Updated 10 years ago
- Saito --> NEW REPOSITORY -->☆12Dec 31, 2025Updated 5 months ago
- Realtime detection of iris position and blinking.☆10Sep 5, 2020Updated 5 years ago
- Tools and Examples for Computational Text Analysis for Assyriologists.☆11Sep 3, 2018Updated 7 years ago
- A curated list of awesome RDM resources for researchers and organisations☆31Mar 2, 2026Updated 3 months ago
- Data visualization workshop☆11May 12, 2020Updated 6 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago