tesseract-ocr / tessdata_fastLinks
Fast integer versions of trained LSTM models
☆586Updated last year
Alternatives and similar repositories for tessdata_fast
Users that are interested in tessdata_fast are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,481Updated last year
- Source training data for Tesseract for lots of languages☆863Updated 9 months ago
- Box editor and trainer for Tesseract OCR☆249Updated 3 weeks ago
- Train Tesseract LSTM with make☆708Updated 8 months ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,003Updated this week
- ☆441Updated 11 years ago
- Data used for LSTM model training☆125Updated last year
- Various documents related to Tesseract OCR☆267Updated 4 years ago
- Tesseract documentation☆2,258Updated 3 weeks ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,333Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆405Updated last year
- Line based ATR Engine based on OCRopy☆1,177Updated 7 months ago
- OCR engine for all the languages☆927Updated 2 weeks ago
- Java GUI frontend for Tesseract OCR engine☆69Updated 3 weeks ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆110Updated 2 years ago
- 📰 Binary distribution of PDFium☆1,234Updated this week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,928Updated last year
- Download Poppler binaries packaged for Windows with dependencies☆1,067Updated 3 weeks ago
- ABBYY Cloud OCR SDK☆528Updated 2 years ago
- Tesseract Open Source OCR Engine (main repository)☆3,978Updated 2 months ago
- Demos, examples and utilities using PyMuPDF☆693Updated last year
- A Python wrapper for the tesseract-ocr API☆2,138Updated 2 weeks ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆169Updated 6 years ago
- Convenience Docker images for Apache Tika Server☆227Updated 3 months ago
- Library used to deskew a scanned document☆495Updated this week
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- ☆1,751Updated 5 years ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆89Updated 3 months ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆155Updated 2 years ago