tesseract-ocr / langdataLinks
Source training data for Tesseract for lots of languages
☆860Updated 7 months ago
Alternatives and similar repositories for langdata
Users that are interested in langdata are comparing it to the libraries listed below
Sorting:
- Various documents related to Tesseract OCR☆267Updated 4 years ago
- Best (most accurate) trained LSTM models.☆1,460Updated last year
- Fast integer versions of trained LSTM models☆579Updated last year
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,250Updated last year
- Box editor and trainer for Tesseract OCR☆247Updated 4 months ago
- Train Tesseract LSTM with make☆704Updated 7 months ago
- ABBYY Cloud OCR SDK☆525Updated 2 years ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,973Updated 3 weeks ago
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- Line based ATR Engine based on OCRopy☆1,170Updated 6 months ago
- A Python wrapper for the tesseract-ocr API☆2,129Updated last month
- A curated list of promising OCR resources☆1,693Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆403Updated last year
- Data used for LSTM model training☆123Updated last year
- OCR engine for all the languages☆910Updated last week
- A simple python OCR engine using opencv☆529Updated last year
- Tesseract documentation☆2,214Updated 2 months ago
- A small C++ implementation of LSTM networks, focused on OCR.☆829Updated 6 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 7 years ago
- make a better chinese character recognition OCR than tesseract☆1,514Updated 8 years ago
- A simple program to extract the text from an image before performing OCR☆222Updated 5 years ago
- Tesseract 4 OCR Compilation - Docker Container☆56Updated 3 years ago
- Links to awesome OCR projects☆3,064Updated last year
- Java GUI and Tools for Tesseract OCR☆336Updated last year
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,083Updated 2 years ago
- Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)☆1,286Updated 4 years ago
- Inspired by Machine Learning course on coursera.org. A helper tool for generating ocr features for Machine Learning algos...☆77Updated 5 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆275Updated 5 years ago
- Basic Optical Character Recognition Tutorial. Damiles Blog.☆121Updated 4 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆110Updated 2 years ago