openpaperwork / pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆931Updated 6 years ago
Alternatives and similar repositories for pyocr:
Users that are interested in pyocr are comparing it to the libraries listed below
- A Python wrapper for the tesseract-ocr API☆2,042Updated last month
- Python-based tools for document analysis and OCR☆3,432Updated 3 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆823Updated 5 years ago
- 🖺 OCR using tensorflow with attention☆647Updated 5 years ago
- A Python wrapper for Google Tesseract☆5,971Updated 2 weeks ago
- A simple python OCR engine using opencv☆527Updated 11 months ago
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆258Updated 4 years ago
- Using neural networks to build an automatic number plate recognition system☆1,848Updated 5 years ago
- Mapping photos of Old New York☆287Updated last month
- Various documents related to Tesseract OCR☆263Updated 3 years ago
- python wrapper for the ZXing barcode library☆275Updated 3 years ago
- The simplest way to extract text from PDFs in Python☆428Updated 2 years ago
- Source training data for Tesseract for lots of languages☆845Updated 10 months ago
- Python script to do PDF OCR conversion using Tesseract☆373Updated last year
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Updated 8 years ago
- This is a reading list for deep learning for OCR☆344Updated 7 years ago
- A very simple content-based recommendation engine. Great for learning, but also ready for real-world use.☆534Updated 4 years ago
- A pure-python HTML screen-scraping library☆1,869Updated 2 years ago
- Small library containing various image processing algorithms (+ Python 3 bindings) that has almost no dependencies -- Moved to Gnome's Gi…☆62Updated 6 years ago
- ☆223Updated 8 years ago
- Magic decorator syntax for asynchronous code in Python☆1,460Updated 4 years ago
- Breaking captchas using torch☆181Updated 9 years ago
- Python wrapper for the tesseract OCR engine. The module is based on OpenCV☆178Updated 7 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,230Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,267Updated 3 years ago
- A captcha library that generates audio and image CAPTCHAs.☆1,028Updated 5 months ago
- Visual Attention based OCR☆1,115Updated 6 years ago
- Scalable Bloom Filter implemented in Python☆1,619Updated 3 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,275Updated 2 years ago
- Webkit based scriptable web browser for python.☆2,760Updated 10 months ago