Various documents related to Tesseract OCR
☆266Sep 12, 2021Updated 4 years ago
Alternatives and similar repositories for docs
Users that are interested in docs are comparing it to the libraries listed below
Sorting:
- Source training data for Tesseract for lots of languages☆863Apr 1, 2025Updated 11 months ago
- l read the classic papers writted by Ray Smith.During reading , l made some notes in Chinese .From now , l have known lots of information…☆31Jan 19, 2018Updated 8 years ago
- Python-based tools for document analysis and OCR☆3,472May 22, 2021Updated 4 years ago
- Tesseract Open Source OCR Engine (main repository)☆72,562Feb 21, 2026Updated last week
- A small C++ implementation of LSTM networks, focused on OCR.☆830Oct 24, 2019Updated 6 years ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,398Mar 9, 2024Updated last year
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Sep 24, 2015Updated 10 years ago
- 🖺 OCR using tensorflow with attention☆645Sep 5, 2019Updated 6 years ago
- A tools can generate samples for OCR trainning. 用于OCR的字符样本生成工具☆65Oct 22, 2017Updated 8 years ago
- A library and command-line tool for fetching Facebook Pages' published posts.☆13Jul 18, 2017Updated 8 years ago
- ☆15Sep 30, 2023Updated 2 years ago
- Links to awesome OCR projects☆3,088Jul 6, 2024Updated last year
- Just a command line interface to StackOverflow☆22Jan 3, 2012Updated 14 years ago
- Grab nonprofit tax information from the ProPublica API and put it in a Google spreadsheet!☆14Jun 2, 2017Updated 8 years ago
- Base model for entities in our Blender Laravel template☆13Apr 1, 2019Updated 6 years ago
- A Python wrapper for the tesseract-ocr API☆2,155Jan 13, 2026Updated last month
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,026Feb 4, 2026Updated 3 weeks ago
- The extra Zookeeper support for Illyria client pool.☆11Jul 3, 2024Updated last year
- Automatically detect paper-based form fields.☆13May 15, 2018Updated 7 years ago
- ☆17Mar 8, 2018Updated 7 years ago
- An easy-to-use, scriptable, command-line interface to JMX servers based on Java/Tcl. Released under the Apache License 2.0.☆15Aug 31, 2016Updated 9 years ago
- Edit distance library for Haskell☆27Jan 19, 2017Updated 9 years ago
- Officially recognized OIDs used in issuance of DigiCert certificates☆17Jan 14, 2026Updated last month
- A collection of Django extensions that add content-management facilities to Django projects.☆41May 7, 2015Updated 10 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Feb 6, 2022Updated 4 years ago
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- ☆15Jun 22, 2020Updated 5 years ago
- ☆18Sep 3, 2017Updated 8 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆195Updated this week
- Best (most accurate) trained LSTM models.☆1,506Mar 9, 2024Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆408Aug 10, 2024Updated last year
- ☆16Aug 8, 2017Updated 8 years ago
- The useful and used parts of NN-Dropout☆25Jun 4, 2015Updated 10 years ago
- Minimizes Django templates so that html is served up already minimized. Minimizes django templates and the html, in-line javascript, and…☆27Dec 6, 2015Updated 10 years ago
- (BROKEN, help wanted)☆15Mar 25, 2016Updated 9 years ago
- Tornado Demo Vulnerable Application to test SQL injection vulnerability and patch it using RASP (Runtime Application Self-Protection)☆11Nov 15, 2017Updated 8 years ago
- A fast data loader for ImageNet on PyTorch.☆18Mar 17, 2019Updated 6 years ago
- Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also incl…☆267Apr 11, 2015Updated 10 years ago