openpaperwork / pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆930Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for pyocr
- Python-based tools for document analysis and OCR☆3,422Updated 3 years ago
- A simple python OCR engine using opencv☆525Updated 9 months ago
- 🖺 OCR using tensorflow with attention☆647Updated 5 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆821Updated 5 years ago
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆257Updated 4 years ago
- Using neural networks to build an automatic number plate recognition system☆1,844Updated 5 years ago
- A Python wrapper for the tesseract-ocr API☆2,016Updated 2 months ago
- Python script to do PDF OCR conversion using Tesseract☆373Updated last year
- An OpenCV based document scanner☆798Updated 8 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,273Updated 3 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,220Updated 2 years ago
- python wrapper for the ZXing barcode library☆274Updated 3 years ago
- Mapping photos of Old New York☆288Updated this week
- Breaking captchas using torch☆181Updated 8 years ago
- Various documents related to Tesseract OCR☆261Updated 3 years ago
- Small library containing various image processing algorithms (+ Python 3 bindings) that has almost no dependencies -- Moved to Gnome's Gi…☆62Updated 6 years ago
- A Python wrapper for Google Tesseract☆5,868Updated 3 weeks ago
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Updated 7 years ago
- Detect text with stroke width transform.☆331Updated 8 years ago
- extract text from any document. no muss. no fuss.☆3,910Updated this week
- The simplest way to extract text from PDFs in Python☆427Updated 2 years ago
- Cross-platform text-to-speech wrapper☆370Updated 3 years ago
- Text page dewarping using a "cubic sheet" model☆1,442Updated last year
- [not actively maintained] A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages☆533Updated 7 years ago
- A simple viewer and inspection tool for text boxes in PDF documents☆92Updated 2 years ago
- A curated list of promising OCR resources☆1,667Updated 2 years ago
- This is a reading list for deep learning for OCR☆346Updated 7 years ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,342Updated last year
- Line based ATR Engine based on OCRopy☆1,051Updated last week