sirfz/tesserocr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sirfz/tesserocr)

sirfz / tesserocr

A Python wrapper for the tesseract-ocr API

☆2,166

Alternatives and similar repositories for tesserocr

Users that are interested in tesserocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

madmaze / pytesseract
View on GitHub
A Python wrapper for Google Tesseract
☆6,373Jul 13, 2026Updated last week
openpaperwork / pyocr
View on GitHub
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
☆929Jun 13, 2018Updated 8 years ago
tesseract-ocr / tesseract
View on GitHub
Tesseract Open Source OCR Engine (main repository)
☆75,547Updated this week
ocropus-archive / DUP-ocropy
View on GitHub
Python-based tools for document analysis and OCR
☆3,467May 22, 2021Updated 5 years ago
JaidedAI / EasyOCR
View on GitHub
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …
☆29,819Dec 5, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jlsutherland / doc2text
View on GitHub
Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.
☆1,279Dec 1, 2020Updated 5 years ago
gregjurman / tesserwrap
View on GitHub
Python bindings to the Tesseract API
☆66Jul 5, 2016Updated 10 years ago
nvdv / vprof
View on GitHub
Visual profiler for Python
☆3,981Jul 15, 2022Updated 4 years ago
Belval / pdf2image
View on GitHub
A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
☆1,975Jul 23, 2024Updated 2 years ago
explosion / spaCy
View on GitHub
💫 Industrial-strength Natural Language Processing (NLP) in Python
☆33,773May 19, 2026Updated 2 months ago
leha-bot / PRLib
View on GitHub
Pre-Recognize Library - library with algorithms for improving OCR quality.
☆112May 2, 2023Updated 3 years ago
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
tesseract-ocr / tessdata
View on GitHub
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,610Mar 9, 2024Updated 2 years ago
nerevu / riko
View on GitHub
A Python stream processing engine modeled after Yahoo! Pipes
☆1,601Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kba / awesome-ocr
View on GitHub
Links to awesome OCR projects
☆3,112Jul 6, 2024Updated 2 years ago
eragonruan / text-detection-ctpn
View on GitHub
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
☆3,430Oct 3, 2023Updated 2 years ago
UB-Mannheim / ocr-fileformat
View on GitHub
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
☆204May 21, 2025Updated last year
virtuald / python-tesseract-sip
View on GitHub
Python SIP wrapper for libtesseract (Apache license)
☆12Feb 20, 2017Updated 9 years ago
gitanat / simple-ocr-opencv
View on GitHub
A simple python OCR engine using opencv
☆533Feb 1, 2024Updated 2 years ago
MagicStack / uvloop
View on GitHub
Ultra fast asyncio event loop.
☆11,866Jul 14, 2026Updated last week
tesseract-ocr / docs
View on GitHub
Various documents related to Tesseract OCR
☆269Sep 12, 2021Updated 4 years ago
pannous / tensorflow-ocr
View on GitHub
🖺 OCR using tensorflow with attention
☆644Sep 5, 2019Updated 6 years ago
OCR-D / ocrd_tesserocr
View on GitHub
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
☆39Jun 5, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
WZBSocialScienceCenter / pdftabextract
View on GitHub
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
☆2,255Jun 24, 2022Updated 4 years ago
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. Ongo…
☆6,190Updated this week
aio-libs / aiohttp
View on GitHub
Asynchronous HTTP client/server framework for asyncio and Python
☆16,504Updated this week
tesseract-ocr / tesstrain
View on GitHub
Train Tesseract LSTM with make
☆722Apr 18, 2025Updated last year
rhsimplex / image-match
View on GitHub
🎇 Quickly search over billions of images
☆2,978Dec 6, 2022Updated 3 years ago
deanmalmgren / textract
View on GitHub
extract text from any document. no muss. no fuss.
☆4,675Jul 11, 2026Updated 2 weeks ago
OpenPhilology / nidaba
View on GitHub
An expandable and scalable OCR pipeline
☆90Nov 14, 2017Updated 8 years ago
mahmoud / boltons
View on GitHub
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…
☆6,906Jul 18, 2026Updated last week
DanBloomberg / leptonica
View on GitHub
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …
☆2,063Jul 12, 2026Updated 2 weeks ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tqdm / tqdm
View on GitHub
A Fast, Extensible Progress Bar for Python and CLI
☆31,249Updated this week
Calamari-OCR / calamari
View on GitHub
Line based ATR Engine based on OCRopy
☆1,198Jun 23, 2026Updated last month
PyImageSearch / imutils
View on GitHub
A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and…
☆4,592Jun 24, 2024Updated 2 years ago
alex-sherman / deco
View on GitHub
☆1,567Nov 3, 2021Updated 4 years ago
altoxml / documentation
View on GitHub
Documentation and use cases for ALTO XML
☆42Sep 10, 2018Updated 7 years ago
UB-Mannheim / GTCheck
View on GitHub
Check your modified Ground Truth files with visual support!
☆10Jan 31, 2024Updated 2 years ago
pdfminer / pdfminer.six
View on GitHub
Community maintained fork of pdfminer - we fathom PDF
☆7,008Mar 13, 2026Updated 4 months ago