A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆931Jun 13, 2018Updated 7 years ago
Alternatives and similar repositories for pyocr
Users that are interested in pyocr are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the tesseract-ocr API☆2,155Jan 13, 2026Updated last month
- Python-based tools for document analysis and OCR☆3,472May 22, 2021Updated 4 years ago
- A Python wrapper for Google Tesseract☆6,318Jan 19, 2026Updated last month
- Tesseract Open Source OCR Engine (main repository)☆72,562Feb 21, 2026Updated last week
- make a better chinese character recognition OCR than tesseract☆1,514Nov 12, 2017Updated 8 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆830Oct 24, 2019Updated 6 years ago
- Python bindings to the Tesseract API☆66Jul 5, 2016Updated 9 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,302Dec 7, 2022Updated 3 years ago
- A simple python OCR engine using opencv☆531Feb 1, 2024Updated 2 years ago
- Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab☆2,435Jun 13, 2018Updated 7 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,170Aug 30, 2021Updated 4 years ago
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- NumPy and Pandas interface to Big Data☆3,197Sep 29, 2023Updated 2 years ago
- extract text from any document. no muss. no fuss.☆4,458Feb 4, 2026Updated 3 weeks ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,279Dec 1, 2020Updated 5 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 8 years ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,517Updated this week
- Accelerate your web app development | Build fast. Run fast.☆18,640Jan 7, 2026Updated last month
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,130Feb 1, 2026Updated last month
- Deep learning library featuring a higher-level API for TensorFlow.☆9,605May 6, 2024Updated last year
- Scan, index, and archive all of your paper documents☆7,925Apr 6, 2021Updated 4 years ago
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,985Jan 15, 2024Updated 2 years ago
- ARCHIVED: A Python API for Tesseract☆20Jul 25, 2017Updated 8 years ago
- Lightweight library to build and train neural networks in Theano☆3,866Mar 26, 2022Updated 3 years ago
- An expandable and scalable OCR pipeline☆89Nov 14, 2017Updated 8 years ago
- Python datetimes made easy☆6,620Feb 17, 2026Updated 2 weeks ago
- Topic Modelling for Humans☆16,361Nov 1, 2025Updated 4 months ago
- A very naive classifier to figure out if a sentence contains dirty words☆33Jul 7, 2015Updated 10 years ago
- Deep Learning for humans☆63,866Updated this week
- 🏹 Better dates & times for Python☆9,029Feb 19, 2026Updated last week
- Ultra fast asyncio event loop.☆11,661Jan 30, 2026Updated last month
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,254Jun 24, 2022Updated 3 years ago
- Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world…☆1,175Dec 30, 2020Updated 5 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Oct 25, 2023Updated 2 years ago
- Datetimes for Humans™☆3,420Jul 19, 2024Updated last year
- Caffe: a fast open framework for deep learning.☆34,778Jul 31, 2024Updated last year
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,839Updated this week
- Coroutine-based concurrency library for Python☆6,439Updated this week