sirfz / tesserocr
A Python wrapper for the tesseract-ocr API
☆2,057Updated last week
Alternatives and similar repositories for tesserocr:
Users that are interested in tesserocr are comparing it to the libraries listed below
- A Python wrapper for Google Tesseract☆6,009Updated this week
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 6 years ago
- Python-based tools for document analysis and OCR☆3,436Updated 3 years ago
- extract text from any document. no muss. no fuss.☆3,972Updated 2 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,710Updated 6 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,681Updated 11 months ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,860Updated last month
- Links to awesome OCR projects☆2,902Updated 7 months ago
- Best (most accurate) trained LSTM models.☆1,295Updated 11 months ago
- A simple python OCR engine using opencv☆527Updated last year
- Line based ATR Engine based on OCRopy☆1,118Updated 3 months ago
- Train Tesseract LSTM with make☆655Updated 8 months ago
- A curated list of promising OCR resources☆1,670Updated 2 years ago
- Visual Attention based OCR☆1,116Updated 6 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,233Updated 2 years ago
- Tesseract Open Source OCR Engine (main repository)☆3,320Updated 2 months ago
- Read one-dimensional barcodes and QR codes from Python 2 and 3.☆749Updated last year
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,546Updated 10 months ago
- pdfrw is a pure Python library that reads and writes PDFs☆1,884Updated 9 months ago
- text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network☆3,437Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,097Updated last month
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆258Updated 4 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,433Updated 6 months ago
- OCR engine for all the languages☆788Updated this week
- A jquery-like library for python☆2,319Updated 5 months ago
- Source training data for Tesseract for lots of languages☆845Updated 11 months ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,442Updated 5 months ago
- Text page dewarping using a "cubic sheet" model☆1,455Updated last year
- Official implementation of Character Region Awareness for Text Detection (CRAFT)☆3,187Updated 7 months ago
- A synthetic data generator for text recognition☆3,403Updated 7 months ago