A Python wrapper for the tesseract-ocr API
☆2,157Jan 13, 2026Updated last month
Alternatives and similar repositories for tesserocr
Users that are interested in tesserocr are comparing it to the libraries listed below
Sorting:
- A Python wrapper for Google Tesseract☆6,318Jan 19, 2026Updated last month
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆931Jun 13, 2018Updated 7 years ago
- Tesseract Open Source OCR Engine (main repository)☆72,688Updated this week
- Python-based tools for document analysis and OCR☆3,472May 22, 2021Updated 4 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,030Dec 5, 2025Updated 3 months ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,279Dec 1, 2020Updated 5 years ago
- Python bindings to the Tesseract API☆66Jul 5, 2016Updated 9 years ago
- Visual profiler for Python☆3,982Jul 15, 2022Updated 3 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,283Updated this week
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,942Jul 23, 2024Updated last year
- Ultra fast asyncio event loop.☆11,674Jan 30, 2026Updated last month
- Asynchronous HTTP client/server framework for asyncio and Python☆16,367Feb 27, 2026Updated last week
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,854Jan 28, 2026Updated last month
- A Fast, Extensible Progress Bar for Python and CLI☆30,985Feb 14, 2026Updated 3 weeks ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,411Mar 9, 2024Updated last year
- ☆1,569Nov 3, 2021Updated 4 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,255Jun 24, 2022Updated 3 years ago
- Links to awesome OCR projects☆3,088Jul 6, 2024Updated last year
- Python datetimes made easy☆6,620Feb 17, 2026Updated 2 weeks ago
- Accelerate your web app development | Build fast. Run fast.☆18,640Jan 7, 2026Updated last month
- 🎇 Quickly search over billions of images☆2,983Dec 6, 2022Updated 3 years ago
- A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and…☆4,592Jun 24, 2024Updated last year
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 10 months ago
- Python SIP wrapper for libtesseract (Apache license)☆12Feb 20, 2017Updated 9 years ago
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,847Updated this week
- ARCHIVED: A Python API for Tesseract☆20Jul 25, 2017Updated 8 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,302Dec 7, 2022Updated 3 years ago
- Deep Learning for humans☆63,869Updated this week
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- A formatter for Python files☆13,988Feb 20, 2026Updated 2 weeks ago
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,140Updated this week
- Python Imaging Library (Fork)☆13,411Updated this week
- Declarative visualization library for Python☆10,276Feb 27, 2026Updated last week
- Datetimes for Humans™☆3,421Jul 19, 2024Updated last year
- text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network☆3,436Oct 3, 2023Updated 2 years ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,895Feb 9, 2026Updated 3 weeks ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 30, 2025Updated 10 months ago
- Python Development Workflow for Humans.☆25,101Feb 16, 2026Updated 2 weeks ago