wincentbalin / pytesstrainView external linksLinks
Python tools for Tesseract OCR training
☆26May 2, 2022Updated 3 years ago
Alternatives and similar repositories for pytesstrain
Users that are interested in pytesstrain are comparing it to the libraries listed below
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- ☆20Aug 18, 2019Updated 6 years ago
- Tutorial on how to create metrics dashboards like the THOR Dashboard☆14Mar 8, 2017Updated 8 years ago
- Recognition Models for Kraken and CLSTM☆16Aug 21, 2019Updated 6 years ago
- Orchestrate web crawlers to create structured datasets from multiple data sources with YAML configs.☆15Dec 8, 2022Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 30, 2025Updated 9 months ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Jan 7, 2026Updated last month
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Opencv python code to read a handwritten word , threshold the characters, draw bounding boxes around it and save the individual letters☆20Sep 24, 2017Updated 8 years ago
- Code accompanying our paper "One Knowledge Graph to Rule them All? Analyzing the Differences between DBpedia, YAGO, Wikidata & co."☆26Jul 18, 2017Updated 8 years ago
- a Deep Learning based Speller☆28Jan 21, 2019Updated 7 years ago
- OxGarage is an web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transfor…☆53Sep 18, 2015Updated 10 years ago
- A text parser.☆31Apr 16, 2022Updated 3 years ago
- python library☆12Nov 25, 2025Updated 2 months ago
- Text Re-use Alignment Visualization☆38Nov 8, 2017Updated 8 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 3 months ago
- Input pipelines for large scale, sharded training of deep learning models.☆40Jun 18, 2019Updated 6 years ago
- Automation of NOAA satellite reception☆13May 14, 2025Updated 8 months ago
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- ☆22Dec 15, 2025Updated last month
- Super simple, zero config options, <2kb declarative tooltip library with no dependencies.☆17Jun 2, 2023Updated 2 years ago
- My OpenCode and Oh-My-OpenCode configuration files with API proxy setup documentation☆28Jan 5, 2026Updated last month
- An open-source command-line tool for developing Unity games with Claude Code☆37Jan 23, 2026Updated 3 weeks ago
- La plateforme derrière nous le peuple. Fork de Pligg.☆10Sep 29, 2015Updated 10 years ago
- A collection of OCR'd and machine-corrected Greek texts. This base repository contains Git submodules for the different works and an inve…☆11Nov 18, 2014Updated 11 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Find the path of the current .ipynb file.☆11Oct 21, 2022Updated 3 years ago
- Faster access to Tesseract-OCR from Python☆13Jun 8, 2021Updated 4 years ago
- Two-Step Approach to OCR Post-Correction☆14May 24, 2024Updated last year
- BioMixer☆17May 18, 2016Updated 9 years ago
- A Persian Word2Vec Model trained by Wikipedia articles☆10Jan 5, 2018Updated 8 years ago
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- Binary of pullcontainer☆10Dec 12, 2014Updated 11 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Format specifiers to use with json-schema-validator☆12Jul 1, 2013Updated 12 years ago
- Repository for DCIS segmentation pipeline☆11Nov 22, 2022Updated 3 years ago
- WIP. A directed graph editor with React, Redux and D3.js☆11Oct 3, 2017Updated 8 years ago
- Implementation of freedesktop.org specifications.☆16Sep 5, 2016Updated 9 years ago