Python tools for Tesseract OCR training
☆26May 2, 2022Updated 4 years ago
Alternatives and similar repositories for pytesstrain
Users that are interested in pytesstrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 5 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Jul 11, 2022Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 6 months ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 6 months ago
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Mar 1, 2026Updated 2 months ago
- OCR & Ground Truth Resources☆78May 3, 2022Updated 4 years ago
- Cosine Similary Search in ElasticSearch + FAISS GPU☆12Mar 24, 2022Updated 4 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆15Dec 6, 2025Updated 5 months ago
- Use any vision LLMs to perform OCR using LangChain☆22Jul 29, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆16Jan 25, 2022Updated 4 years ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆22Nov 6, 2021Updated 4 years ago
- Binary of pullcontainer☆10Dec 12, 2014Updated 11 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Single source publishing for vertical writing☆11Mar 15, 2021Updated 5 years ago
- Page-wise text recognition with lower-supervision line data models☆53May 19, 2026Updated last week
- Thai Law Dataset (Act of Parliament)☆24Jul 21, 2021Updated 4 years ago
- Kong OAuth SSO Integration☆16Aug 23, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- presentations for busy messy hackers☆36Jan 21, 2014Updated 12 years ago
- Orchestrate web crawlers to create structured datasets from multiple data sources with YAML configs.☆16Dec 8, 2022Updated 3 years ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39May 5, 2026Updated 3 weeks ago
- Tesseract tessdata downloader from GitHub repositories☆11Sep 17, 2021Updated 4 years ago
- Tutorial on how to create metrics dashboards like the THOR Dashboard☆14Mar 8, 2017Updated 9 years ago
- Recognition Models for Kraken and CLSTM☆17Aug 21, 2019Updated 6 years ago
- ☆11Aug 8, 2016Updated 9 years ago
- Node.js vnc client with gui☆19May 1, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A curated list of awesome RDM resources for researchers and organisations☆31Mar 2, 2026Updated 2 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- K-RET: Knowledgeable Biomedical Relation Extraction System☆10Feb 22, 2025Updated last year
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆60Apr 16, 2021Updated 5 years ago
- An airgapped encryption/decryption device for off-grid communication☆10Aug 16, 2023Updated 2 years ago
- Train Tesseract LSTM with tesstrain.sh on Windows☆26Dec 24, 2023Updated 2 years ago
- Anomaly detection in time-series networks. Spatio-temporal Anomaly Detection☆12Jan 9, 2020Updated 6 years ago