sirfz / tesserocrLinks
A Python wrapper for the tesseract-ocr API
☆2,120Updated last month
Alternatives and similar repositories for tesserocr
Users that are interested in tesserocr are comparing it to the libraries listed below
Sorting:
- A Python wrapper for Google Tesseract☆6,229Updated last week
- Python-based tools for document analysis and OCR☆3,465Updated 4 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 7 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,870Updated last year
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,467Updated last week
- Train Tesseract LSTM with make☆698Updated 5 months ago
- Best (most accurate) trained LSTM models.☆1,425Updated last year
- Line based ATR Engine based on OCRopy☆1,166Updated 4 months ago
- Links to awesome OCR projects☆3,046Updated last year
- Read one-dimensional barcodes and QR codes from Python 2 and 3.☆789Updated last year
- OCR engine for all the languages☆889Updated last week
- A simple python OCR engine using opencv☆531Updated last year
- Text page dewarping using a "cubic sheet" model☆1,485Updated 2 years ago
- Source training data for Tesseract for lots of languages☆859Updated 6 months ago
- extract text from any document. no muss. no fuss.☆4,315Updated 10 months ago
- text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network☆3,443Updated 2 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,246Updated 3 years ago
- Tesseract documentation☆2,179Updated 2 weeks ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,167Updated last year
- Community maintained fork of pdfminer - we fathom PDF☆6,739Updated 4 months ago
- Various documents related to Tesseract OCR☆266Updated 4 years ago
- A Python Perceptual Image Hashing Module☆3,721Updated 5 months ago
- A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and…☆4,573Updated last year
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,623Updated 5 months ago
- A tensorflow implementation of EAST text detector☆3,057Updated 2 years ago
- A synthetic data generator for text recognition☆3,571Updated last year
- TableBank: A Benchmark Dataset for Table Detection and Recognition☆1,070Updated last year
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,277Updated 4 years ago
- Library used to deskew a scanned document☆485Updated this week
- A Python library for reading and writing PDF, powered by QPDF