Shreeshrii / tess5train-fontsView external linksLinks
Files and Scripts to run Tesseract 5 LSTM Training using fonts
☆79Feb 6, 2022Updated 4 years ago
Alternatives and similar repositories for tess5train-fonts
Users that are interested in tess5train-fonts are comparing it to the libraries listed below
Sorting:
- Train Tesseract LSTM with make☆714Apr 18, 2025Updated 9 months ago
- ☆16Mar 24, 2021Updated 4 years ago
- Finetuned traineddata files for Arabic☆31Feb 28, 2019Updated 6 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 7 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Nov 16, 2025Updated 3 months ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 9 months ago
- ☆28Oct 5, 2022Updated 3 years ago
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 9 months ago
- ☆15Jun 22, 2020Updated 5 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Jan 20, 2026Updated 3 weeks ago
- Pretrained mixed models to be used with Calamari.☆69Oct 1, 2024Updated last year
- Quicksign OCRized Text Dataset (QS-OCR)☆45May 7, 2019Updated 6 years ago
- Bu Uyghur yéziqini Pythonning gensim ambiridiki word2vec algorizimida sinap baqqan misal.☆16Dec 1, 2021Updated 4 years ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆26Dec 24, 2014Updated 11 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Jan 7, 2026Updated last month
- ☆22Oct 23, 2015Updated 10 years ago
- ☆18Jan 14, 2021Updated 5 years ago
- ☆21Jul 24, 2019Updated 6 years ago
- Receipt.ID is a multi-label, multi-class, hierarchical classification system implemented in a two layer feed forward network.☆22Nov 27, 2017Updated 8 years ago
- DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.☆24Oct 13, 2025Updated 4 months ago
- Tesseract-ocr for Thai language☆23Feb 23, 2018Updated 7 years ago
- ☆10Jul 29, 2025Updated 6 months ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆59Apr 16, 2021Updated 4 years ago
- ☆28Jul 17, 2019Updated 6 years ago
- Line based ATR Engine based on OCRopy☆1,185May 12, 2025Updated 9 months ago
- Python 3 library for processing historical English☆68Aug 10, 2024Updated last year
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- Fast integer versions of trained LSTM models☆595Aug 1, 2024Updated last year
- Detecting Radiological Threats in Urban Areas (9th place solution)☆10May 4, 2019Updated 6 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 3 months ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 5 years ago
- The Framework for Optimization of Resources, Controls, and Economics is a collection of software tools, models, and datasets acquired and…☆11Jan 2, 2025Updated last year
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Sep 29, 2016Updated 9 years ago
- Conversions between various OCR formats☆82Updated this week
- Spell correction language model for Uyghur language based on transformer neural network☆14Jun 18, 2025Updated 7 months ago
- hmm-filter: Improve classifier predictions for sequential data with Hidden Markov Models (HMMs)☆12Jan 23, 2019Updated 7 years ago
- An example Vue project displaying a PDF viewer☆11Jan 6, 2023Updated 3 years ago