tesseract-ocr / langdataLinks
Source training data for Tesseract for lots of languages
☆859Updated 5 months ago
Alternatives and similar repositories for langdata
Users that are interested in langdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,415Updated last year
- Various documents related to Tesseract OCR☆266Updated 4 years ago
- Fast integer versions of trained LSTM models☆564Updated last year
- Box editor and trainer for Tesseract OCR☆246Updated 2 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,140Updated last year
- Tesseract documentation☆75Updated 4 years ago
- ABBYY Cloud OCR SDK☆522Updated 2 years ago
- Train Tesseract LSTM with make☆696Updated 5 months ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,958Updated last month
- Python-based tools for document analysis and OCR☆3,464Updated 4 years ago
- Data used for LSTM model training☆121Updated last year
- Line based ATR Engine based on OCRopy☆1,162Updated 4 months ago
- Tesseract documentation☆2,160Updated last month
- OCR engine for all the languages☆879Updated 2 weeks ago
- Java GUI and Tools for Tesseract OCR☆334Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆396Updated last year
- Links to awesome OCR projects☆3,041Updated last year
- finetuned traineddata files for tesseract 4.0.0 for testing☆168Updated 6 years ago
- A curated list of promising OCR resources☆1,692Updated 3 years ago
- make a better chinese character recognition OCR than tesseract☆1,514Updated 7 years ago
- Tesseract 4 OCR Compilation - Docker Container☆55Updated 3 years ago
- charlesw/tesseract 4.0 build for x64 Windows using C++ run-time 141.☆61Updated 7 years ago
- Real-time image preprocess and OCR.☆274Updated 3 years ago
- A simple python OCR engine using opencv☆531Updated last year
- A scientific document recognition system☆171Updated 2 years ago
- Tesseract Open Source OCR Engine (main repository)☆3,764Updated 2 months ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,081Updated last year
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- Inspired by Machine Learning course on coursera.org. A helper tool for generating ocr features for Machine Learning algos...☆77Updated 5 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆132Updated 2 years ago