A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆931Jun 13, 2018Updated 7 years ago
Alternatives and similar repositories for pyocr
Users that are interested in pyocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python wrapper for the tesseract-ocr API☆2,160Mar 16, 2026Updated last week
- Python-based tools for document analysis and OCR☆3,474May 22, 2021Updated 4 years ago
- A Python wrapper for Google Tesseract☆6,322Mar 16, 2026Updated last week
- Tesseract Open Source OCR Engine (main repository)☆72,962Mar 16, 2026Updated last week
- make a better chinese character recognition OCR than tesseract☆1,514Nov 12, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A small C++ implementation of LSTM networks, focused on OCR.☆830Oct 24, 2019Updated 6 years ago
- Python bindings to the Tesseract API☆66Jul 5, 2016Updated 9 years ago
- A simple python OCR engine using opencv☆532Feb 1, 2024Updated 2 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,302Dec 7, 2022Updated 3 years ago
- Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab☆2,435Jun 13, 2018Updated 7 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,168Aug 30, 2021Updated 4 years ago
- NumPy and Pandas interface to Big Data☆3,195Sep 29, 2023Updated 2 years ago
- Fuzzy String Matching in Python☆9,265Feb 24, 2023Updated 3 years ago
- extract text from any document. no muss. no fuss.☆4,483Feb 4, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,279Dec 1, 2020Updated 5 years ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,370Sep 15, 2023Updated 2 years ago
- Scan, index, and archive all of your paper documents☆7,926Apr 6, 2021Updated 4 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,258Jun 24, 2022Updated 3 years ago
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 7 years ago
- Assignment of Image Analysis and Understanding☆49Apr 10, 2017Updated 8 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,352Mar 15, 2026Updated last week
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,518Mar 16, 2026Updated last week
- Accelerate your web app development | Build fast. Run fast.☆18,637Jan 7, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,157Mar 1, 2026Updated 3 weeks ago
- OCR with caffe deep learning framework -> Migrated to tensorflow☆215Dec 22, 2016Updated 9 years ago
- Deep learning library featuring a higher-level API for TensorFlow.☆9,596May 6, 2024Updated last year
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- 🏹 Better dates & times for Python☆9,036Feb 19, 2026Updated last month
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,985Jan 15, 2024Updated 2 years ago
- Coroutine-based concurrency library for Python☆6,447Mar 1, 2026Updated 3 weeks ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 8 years ago
- Webkit based scriptable web browser for python.☆2,761Feb 24, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Deep Learning for humans☆63,955Updated this week
- Topic Modelling for Humans☆16,378Nov 1, 2025Updated 4 months ago
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,882Mar 17, 2026Updated last week
- Caffe: a fast open framework for deep learning.☆34,774Jul 31, 2024Updated last year
- ☆10Mar 16, 2023Updated 3 years ago
- Python datetimes made easy☆6,632Mar 6, 2026Updated 2 weeks ago
- Python Development Workflow for Humans.☆25,101Updated this week