Train Tesseract LSTM with make
☆714Apr 18, 2025Updated 10 months ago
Alternatives and similar repositories for tesstrain
Users that are interested in tesstrain are comparing it to the libraries listed below
Sorting:
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Feb 6, 2022Updated 4 years ago
- Dockerized example to train Tesseract v. 4☆63Dec 8, 2022Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- ALTO XML schema - latest and all former versions☆55Jan 20, 2026Updated last month
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆23Feb 21, 2018Updated 8 years ago
- User contributed (non Google) OCR models for Tesseract☆30Apr 18, 2025Updated 10 months ago
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 7 years ago
- OCR & Ground Truth Resources☆78May 3, 2022Updated 3 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- OCR engine for all the languages☆956Feb 25, 2026Updated last week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 30, 2025Updated 10 months ago
- Wrapper for the kraken OCR engine☆12Jul 12, 2025Updated 7 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆195Updated this week
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆60Apr 16, 2021Updated 4 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Fast integer versions of trained LSTM models☆595Aug 1, 2024Updated last year
- Line based ATR Engine based on OCRopy☆1,184May 12, 2025Updated 9 months ago
- Python tools for Tesseract OCR training☆26May 2, 2022Updated 3 years ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 4 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated last month
- Collection of OCR-related python tools and wrappers from @OCR-D☆133Feb 4, 2026Updated last month
- Train Tesseract LSTM with make on Windows☆10Dec 24, 2023Updated 2 years ago
- A synthetic data generator for text recognition☆3,646Jul 18, 2024Updated last year
- Tesseract Config files☆32Sep 12, 2021Updated 4 years ago
- Tesseract documentation☆2,303Feb 23, 2026Updated last week
- Recognize text using Calamari OCR and the OCR-D framework☆15May 13, 2025Updated 9 months ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- Tesseract Open Source OCR Engine (main repository)☆72,688Updated this week
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,411Mar 9, 2024Updated last year
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 8 months ago
- Conversions between various OCR formats☆83Feb 13, 2026Updated 3 weeks ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,028Feb 28, 2026Updated last week