openpaperwork / pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆928Updated 6 years ago
Alternatives and similar repositories for pyocr:
Users that are interested in pyocr are comparing it to the libraries listed below
- A Python wrapper for the tesseract-ocr API☆2,084Updated 2 months ago
- Python-based tools for document analysis and OCR☆3,448Updated 3 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆825Updated 5 years ago
- An OpenCV based document scanner☆808Updated 8 years ago
- A simple python OCR engine using opencv☆531Updated last year
- Mapping photos of Old New York☆288Updated 4 months ago
- Python script to do PDF OCR conversion using Tesseract☆374Updated last year
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,276Updated 4 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,237Updated 2 years ago
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- python wrapper for the ZXing barcode library☆274Updated 3 years ago
- Using neural networks to build an automatic number plate recognition system☆1,849Updated 5 years ago
- Detect text with stroke width transform.☆333Updated 9 years ago
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆259Updated 4 years ago
- Source training data for Tesseract for lots of languages☆853Updated 2 weeks ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,362Updated last year
- Breaking captchas using torch☆181Updated 9 years ago
- [not actively maintained] A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages☆534Updated 7 years ago
- The simplest way to extract text from PDFs in Python☆427Updated 2 years ago
- make a better chinese character recognition OCR than tesseract☆1,515Updated 7 years ago
- ☆223Updated 8 years ago
- A Python wrapper for Google Tesseract☆6,081Updated 2 weeks ago
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Updated 8 years ago
- A Python to Vega translator☆2,032Updated 8 years ago
- Small library containing various image processing algorithms (+ Python 3 bindings) that has almost no dependencies -- Moved to Gnome's Gi…☆62Updated 6 years ago
- This is a reading list for deep learning for OCR☆344Updated 7 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,286Updated 2 years ago
- Robust Text Detection implementation based on http://www.stanford.edu/~hchen2/papers/ICIP2011_RobustTextDetection.pdf☆158Updated 7 years ago
- DEPRECATED: Replaced by https://github.com/autopilot-rs/autopy☆842Updated 6 years ago
- Neural network OCR.☆1,129Updated 8 years ago