tesseract-ocr / langdata
Source training data for Tesseract for lots of languages
☆850Updated last year
Alternatives and similar repositories for langdata:
Users that are interested in langdata are comparing it to the libraries listed below
- Fast integer versions of trained LSTM models☆523Updated 7 months ago
- Best (most accurate) trained LSTM models.☆1,319Updated last year
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,785Updated last year
- Train Tesseract LSTM with make☆660Updated 9 months ago
- A Python wrapper for the tesseract-ocr API☆2,079Updated last month
- ABBYY Cloud OCR SDK☆513Updated last year
- Box editor and trainer for Tesseract OCR☆237Updated 8 months ago
- A curated list of promising OCR resources☆1,676Updated 2 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 6 years ago
- Tesseract documentation☆1,988Updated last month
- A Python wrapper for Google Tesseract☆6,055Updated last month
- OCR engine for all the languages☆796Updated this week
- Line based ATR Engine based on OCRopy☆1,126Updated 2 weeks ago
- 🖺 OCR using tensorflow with attention☆647Updated 5 years ago
- Data used for LSTM model training☆116Updated last year
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 2 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆825Updated 5 years ago
- make a better chinese character recognition OCR than tesseract☆1,512Updated 7 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 7 months ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,081Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Library used to deskew a scanned document☆447Updated this week
- Document Image Dewarping☆362Updated 5 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆271Updated 4 years ago
- Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)☆1,285Updated 3 years ago
- Document Layout Analysis☆361Updated this week
- 身份证识别OCR☆479Updated last year
- finetuned traineddata files for tesseract 4.0.0 for testing☆163Updated 5 years ago