tesseract-ocr / langdata
Source training data for Tesseract for lots of languages
☆856Updated last month
Alternatives and similar repositories for langdata
Users that are interested in langdata are comparing it to the libraries listed below
Sorting:
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- Best (most accurate) trained LSTM models.☆1,337Updated last year
- Train Tesseract LSTM with make☆674Updated 3 weeks ago
- Fast integer versions of trained LSTM models☆534Updated 9 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,904Updated last year
- Tesseract documentation☆75Updated 3 years ago
- Data used for LSTM model training☆117Updated last year
- Tesseract documentation☆2,029Updated 3 months ago
- OCR engine for all the languages☆826Updated this week
- Box editor and trainer for Tesseract OCR☆240Updated 10 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆391Updated 9 months ago
- A Python wrapper for the tesseract-ocr API☆2,094Updated this week
- Links to awesome OCR projects☆2,970Updated 10 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,779Updated 9 months ago
- Line based ATR Engine based on OCRopy☆1,134Updated 3 weeks ago
- ABBYY Cloud OCR SDK☆514Updated last year
- A small C++ implementation of LSTM networks, focused on OCR.☆825Updated 5 years ago
- A Python wrapper for Google Tesseract☆6,093Updated last month
- Python-based tools for document analysis and OCR☆3,449Updated 3 years ago
- ☆422Updated 10 years ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,082Updated last year
- An implementation of RESTful web service for tesseract-OCR using tornado☆136Updated last year
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆928Updated 6 years ago
- 🖺 OCR using tensorflow with attention☆647Updated 5 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆612Updated last year
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 5 months ago
- A curated list of promising OCR resources☆1,681Updated 2 years ago
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 3 years ago