tesseract-ocr/tessdoc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tesseract-ocr/tessdoc)

tesseract-ocr / tessdoc

Tesseract documentation

☆2,405

Alternatives and similar repositories for tessdoc

Users that are interested in tessdoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tesseract-ocr / tesseract
View on GitHub
Tesseract Open Source OCR Engine (main repository)
☆75,596Updated this week
tesseract-ocr / tessdata_best
View on GitHub
Best (most accurate) trained LSTM models.
☆1,567Mar 9, 2024Updated 2 years ago
tesseract-ocr / tessdata
View on GitHub
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,613Mar 9, 2024Updated 2 years ago
UB-Mannheim / tesseract
View on GitHub
Tesseract Open Source OCR Engine (main repository)
☆4,539Updated this week
tesseract-ocr / tesstrain
View on GitHub
Train Tesseract LSTM with make
☆722Apr 18, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
otiai10 / gosseract
View on GitHub
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
☆3,125Jan 16, 2026Updated 6 months ago
madmaze / pytesseract
View on GitHub
A Python wrapper for Google Tesseract
☆6,373Jul 13, 2026Updated 2 weeks ago
tesseract-ocr / tessdata_fast
View on GitHub
Fast integer versions of trained LSTM models
☆607Aug 1, 2024Updated last year
tesseract-ocr / langdata_lstm
View on GitHub
Data used for LSTM model training
☆127Mar 9, 2024Updated 2 years ago
Shreeshrii / tess5train-fonts
View on GitHub
Files and Scripts to run Tesseract 5 LSTM Training using fonts
☆78Feb 6, 2022Updated 4 years ago
JaidedAI / EasyOCR
View on GitHub
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …
☆29,826Dec 5, 2025Updated 7 months ago
altoxml / documentation
View on GitHub
Documentation and use cases for ALTO XML
☆42Sep 10, 2018Updated 7 years ago
leha-bot / PRLib
View on GitHub
Pre-Recognize Library - library with algorithms for improving OCR quality.
☆112May 2, 2023Updated 3 years ago
OCR-D / format-converters
View on GitHub
Converters for various file formats used for representing OCR
☆12Apr 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆86,350Jul 22, 2026Updated last week
tesseract-ocr / langdata
View on GitHub
Source training data for Tesseract for lots of languages
☆870Apr 1, 2025Updated last year
ocrmypdf / OCRmyPDF
View on GitHub
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆34,298Updated this week
DanBloomberg / leptonica
View on GitHub
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …
☆2,064Jul 12, 2026Updated 2 weeks ago
altoxml / schema
View on GitHub
ALTO XML schema - latest and all former versions
☆55Jul 8, 2026Updated 3 weeks ago
otiai10 / ocrserver
View on GitHub
A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well
☆767Aug 5, 2021Updated 4 years ago
livezingy / tesstrainsh-win
View on GitHub
Train Tesseract LSTM with tesstrain.sh on Windows
☆26Dec 24, 2023Updated 2 years ago
ryanfb / latinocr-lat
View on GitHub
'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
☆13Jan 13, 2016Updated 10 years ago
nguyenq / jTessBoxEditor
View on GitHub
Box editor and trainer for Tesseract OCR
☆247Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sirfz / tesserocr
View on GitHub
A Python wrapper for the tesseract-ocr API
☆2,166Mar 16, 2026Updated 4 months ago
mittagessen / kraken
View on GitHub
OCR engine for all the languages
☆1,040Updated this week
tesseract-ocr / docs
View on GitHub
Various documents related to Tesseract OCR
☆269Sep 12, 2021Updated 4 years ago
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,839Apr 15, 2026Updated 3 months ago
kba / awesome-ocr
View on GitHub
Links to awesome OCR projects
☆3,112Jul 6, 2024Updated 2 years ago
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. Ongo…
☆6,194Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆177,051Updated this week
UB-Mannheim / zotero-ocr
View on GitHub
Zotero Plugin for OCR
☆807Jun 4, 2026Updated last month
jbaiter / hocrviewer-mirador
View on GitHub
View HOCR files with Mirador
☆30Sep 27, 2017Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
opencv / opencv
View on GitHub
Open Source Computer Vision Library
☆90,181Updated this week
naptha / tesseract.js
View on GitHub
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
☆38,572May 17, 2026Updated 2 months ago
pymupdf / PyMuPDF
View on GitHub
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
☆10,334Updated this week
UB-Mannheim / ocr-fileformat
View on GitHub
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
☆204May 21, 2025Updated last year
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,294Mar 2, 2026Updated 4 months ago
filak / hOCR-to-ALTO
View on GitHub
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
☆60Mar 20, 2026Updated 4 months ago
charlesw / tesseract
View on GitHub
A .Net wrapper for tesseract-ocr
☆2,457Apr 29, 2025Updated last year