tesseract-ocr / langdataLinks
Source training data for Tesseract for lots of languages
☆859Updated 4 months ago
Alternatives and similar repositories for langdata
Users that are interested in langdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,406Updated last year
- Various documents related to Tesseract OCR☆266Updated 3 years ago
- Fast integer versions of trained LSTM models☆563Updated last year
- Train Tesseract LSTM with make☆690Updated 4 months ago
- Box editor and trainer for Tesseract OCR☆245Updated 2 months ago
- Line based ATR Engine based on OCRopy☆1,157Updated 3 months ago
- A simple python OCR engine using opencv☆531Updated last year
- Python-based tools for document analysis and OCR☆3,462Updated 4 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 7 years ago
- A Python wrapper for the tesseract-ocr API☆2,113Updated 3 weeks ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- Data used for LSTM model training☆119Updated last year
- A curated list of promising OCR resources☆1,692Updated 3 years ago
- Tesseract documentation☆2,148Updated 2 weeks ago
- Links to awesome OCR projects☆3,036Updated last year
- OCR engine for all the languages☆867Updated last week
- Java GUI and Tools for Tesseract OCR☆333Updated last year
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 3 years ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆168Updated 6 years ago
- A Python wrapper for Google Tesseract☆6,204Updated 2 weeks ago
- Real-time image preprocess and OCR.☆274Updated 3 years ago
- make a better chinese character recognition OCR than tesseract☆1,515Updated 7 years ago
- A scientific document recognition system☆171Updated 2 years ago
- Detect and fix skew in images containing text☆267Updated 6 years ago
- OCR evaluation brought to you by University of Alicante☆68Updated 2 years ago
- A simple program to extract the text from an image before performing OCR☆222Updated 5 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆275Updated 5 years ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,081Updated last year
- Repository for tesseract testing☆34Updated last year
- Inspired by Machine Learning course on coursera.org. A helper tool for generating ocr features for Machine Learning algos...☆77Updated 5 years ago