A Python wrapper for the tesseract-ocr API
☆2,163Mar 16, 2026Updated last month
Alternatives and similar repositories for tesserocr
Users that are interested in tesserocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python wrapper for Google Tesseract☆6,334Mar 16, 2026Updated last month
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Jun 13, 2018Updated 7 years ago
- Tesseract Open Source OCR Engine (main repository)☆73,879Apr 27, 2026Updated last week
- Python-based tools for document analysis and OCR☆3,471May 22, 2021Updated 4 years ago
- Python bindings to the Tesseract API☆66Jul 5, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,396Dec 5, 2025Updated 5 months ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,279Dec 1, 2020Updated 5 years ago
- Visual profiler for Python☆3,980Jul 15, 2022Updated 3 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,965Jul 23, 2024Updated last year
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,544Mar 28, 2026Updated last month
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆112May 2, 2023Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated last year
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,514Mar 9, 2024Updated 2 years ago
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Links to awesome OCR projects☆3,105Jul 6, 2024Updated last year
- text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network☆3,436Oct 3, 2023Updated 2 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated 11 months ago
- Python SIP wrapper for libtesseract (Apache license)☆12Feb 20, 2017Updated 9 years ago
- Ultra fast asyncio event loop.☆11,779Updated this week
- A simple python OCR engine using opencv☆532Feb 1, 2024Updated 2 years ago
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- Various documents related to Tesseract OCR☆267Sep 12, 2021Updated 4 years ago
- 🖺 OCR using tensorflow with attention☆645Sep 5, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,256Jun 24, 2022Updated 3 years ago
- Asynchronous HTTP client/server framework for asyncio and Python☆16,428Updated this week
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,058Apr 27, 2026Updated last week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 29, 2026Updated last week
- 🎇 Quickly search over billions of images☆2,970Dec 6, 2022Updated 3 years ago
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,882Mar 6, 2026Updated 2 months ago
- Train Tesseract LSTM with make☆722Apr 18, 2025Updated last year
- A Fast, Extensible Progress Bar for Python and CLI☆31,138Feb 14, 2026Updated 2 months ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,048Apr 18, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆1,568Nov 3, 2021Updated 4 years ago
- A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and…☆4,594Jun 24, 2024Updated last year
- Line based ATR Engine based on OCRopy☆1,190May 12, 2025Updated 11 months ago
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,979Updated this week
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Community maintained fork of pdfminer - we fathom PDF☆6,966Mar 13, 2026Updated last month