A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆929Jun 13, 2018Updated 8 years ago
Alternatives and similar repositories for pyocr
Users that are interested in pyocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python wrapper for the tesseract-ocr API☆2,168Mar 16, 2026Updated 2 months ago
- Python-based tools for document analysis and OCR☆3,468May 22, 2021Updated 5 years ago
- A Python wrapper for Google Tesseract☆6,360May 25, 2026Updated 3 weeks ago
- Tesseract Open Source OCR Engine (main repository)☆74,618Jun 4, 2026Updated last week
- make a better chinese character recognition OCR than tesseract☆1,511Nov 12, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A small C++ implementation of LSTM networks, focused on OCR.☆832Oct 24, 2019Updated 6 years ago
- ARCHIVED: A Python API for Tesseract☆20Jul 25, 2017Updated 8 years ago
- Python bindings to the Tesseract API☆66Jul 5, 2016Updated 9 years ago
- A simple python OCR engine using opencv☆532Feb 1, 2024Updated 2 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,290Dec 7, 2022Updated 3 years ago
- Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab☆2,436Mar 26, 2026Updated 2 months ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,171Aug 30, 2021Updated 4 years ago
- NumPy and Pandas interface to Big Data☆3,192Sep 29, 2023Updated 2 years ago
- Fuzzy String Matching in Python☆9,258Feb 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,279Dec 1, 2020Updated 5 years ago
- extract text from any document. no muss. no fuss.☆4,614May 7, 2026Updated last month
- Scan, index, and archive all of your paper documents☆7,920Apr 6, 2021Updated 5 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,257Jun 24, 2022Updated 3 years ago
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 8 years ago
- Assignment of Image Analysis and Understanding☆48Apr 10, 2017Updated 9 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,652May 19, 2026Updated 3 weeks ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,535Updated this week
- Accelerate your web app development | Build fast. Run fast.☆18,629May 31, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,201Apr 1, 2026Updated 2 months ago
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Dec 22, 2016Updated 9 years ago
- Deep learning library featuring a higher-level API for TensorFlow.☆9,579May 6, 2024Updated 2 years ago
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- 🏹 Better dates & times for Python☆9,045Jun 8, 2026Updated last week
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,995Jan 15, 2024Updated 2 years ago
- Coroutine-based concurrency library for Python☆6,441Jun 3, 2026Updated last week
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 9 years ago
- Webkit based scriptable web browser for python.☆2,756Feb 24, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Deep Learning for humans☆64,079Updated this week
- Topic Modelling for Humans☆16,432Nov 1, 2025Updated 7 months ago
- Caffe: a fast open framework for deep learning.☆34,577Jul 31, 2024Updated last year
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆10,036Updated this week
- Django Happy Urls☆44Apr 4, 2014Updated 12 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- Python datetimes made easy☆6,670May 14, 2026Updated last month