openpaperwork / pyocrLinks
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆927Updated 7 years ago
Alternatives and similar repositories for pyocr
Users that are interested in pyocr are comparing it to the libraries listed below
Sorting:
- A small C++ implementation of LSTM networks, focused on OCR.☆827Updated 5 years ago
- A Python wrapper for the tesseract-ocr API☆2,102Updated 3 weeks ago
- Python-based tools for document analysis and OCR☆3,450Updated 4 years ago
- A simple python OCR engine using opencv☆532Updated last year
- 🖺 OCR using tensorflow with attention☆647Updated 5 years ago
- A Python wrapper for Google Tesseract☆6,138Updated 3 weeks ago
- Source training data for Tesseract for lots of languages☆857Updated 2 months ago
- Python script to do PDF OCR conversion using Tesseract☆375Updated 2 years ago
- Various documents related to Tesseract OCR☆266Updated 3 years ago
- This is a reading list for deep learning for OCR☆344Updated 7 years ago
- An OpenCV based document scanner☆809Updated 8 years ago
- Python 3 port of pdfminer☆186Updated 6 years ago
- A pure-python HTML screen-scraping library☆1,877Updated 3 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,243Updated 2 years ago
- python wrapper for the ZXing barcode library☆274Updated 3 years ago
- Mapping photos of Old New York☆290Updated 6 months ago
- Webkit based scriptable web browser for python.☆2,765Updated last year
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆214Updated 5 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,275Updated 4 years ago
- ☆223Updated 9 years ago
- Using neural networks to build an automatic number plate recognition system☆1,854Updated 5 years ago
- A high-level distributed crawling framework.☆1,508Updated 2 years ago
- A simple program to extract the text from an image before performing OCR☆222Updated 5 years ago
- Python wrapper for the tesseract OCR engine. The module is based on OpenCV☆177Updated 7 years ago
- Detect text with stroke width transform.☆333Updated 9 years ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,366Updated last year
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Updated 8 years ago
- Extends Selenium WebDriver classes to include the request function from the Requests library, while doing all the needed cookie and reque…☆496Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,274Updated 3 years ago
- The ctypes-based simple ImageMagick binding for Python☆1,454Updated 2 months ago