tesseract-ocr / tessdocLinks
Tesseract documentation
☆2,253Updated 3 weeks ago
Alternatives and similar repositories for tessdoc
Users that are interested in tessdoc are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,476Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,959Updated 2 months ago
- Fast integer versions of trained LSTM models☆585Updated last year
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,995Updated this week
- Train Tesseract LSTM with make☆707Updated 8 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,319Updated last year
- A Python wrapper for Google Tesseract☆6,288Updated this week
- Source training data for Tesseract for lots of languages☆863Updated 8 months ago
- OCR engine for all the languages☆926Updated last week
- Line based ATR Engine based on OCRopy☆1,177Updated 7 months ago
- Download Poppler binaries packaged for Windows with dependencies☆1,054Updated 3 weeks ago
- Links to awesome OCR projects☆3,069Updated last year
- A Python wrapper for the tesseract-ocr API☆2,140Updated last week
- Tesseract Open Source OCR Engine (main repository)☆71,496Updated last week
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,473Updated 3 months ago
- Demos, examples and utilities using PyMuPDF☆692Updated last year
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆523Updated 4 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,924Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆404Updated last year
- A Gtk/Qt front-end to tesseract-ocr.☆1,891Updated 3 months ago
- Data used for LSTM model training☆124Updated last year
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,727Updated last week
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆733Updated 3 weeks ago
- Box editor and trainer for Tesseract OCR☆248Updated 2 weeks ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆154Updated 2 years ago
- Library used to deskew a scanned document☆495Updated 3 weeks ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- ☆996Updated last year
- Python bindings to PDFium, reasonably cross-platform.☆696Updated this week
- mupdf mirror☆2,484Updated last week