tesseract-ocr/docs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tesseract-ocr/docs)

tesseract-ocr / docs

Various documents related to Tesseract OCR

☆269

Alternatives and similar repositories for docs

Users that are interested in docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tesseract-ocr / langdata
View on GitHub
Source training data for Tesseract for lots of languages
☆870Apr 1, 2025Updated last year
guzhenping / the-Papers-and-Data-of-Tesseract-OCR-
View on GitHub
l read the classic papers writted by Ray Smith.During reading , l made some notes in Chinese .From now , l have known lots of information…
☆31Jan 19, 2018Updated 8 years ago
tesseract-ocr / tesseract
View on GitHub
Tesseract Open Source OCR Engine (main repository)
☆75,547Updated this week
ocropus-archive / DUP-ocropy
View on GitHub
Python-based tools for document analysis and OCR
☆3,467May 22, 2021Updated 5 years ago
tmbarchive / docker-ocropus
View on GitHub
A small Docker built for the OCRopus OCR system.
☆19Dec 16, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tmbdev / clstm
View on GitHub
A small C++ implementation of LSTM networks, focused on OCR.
☆833Oct 24, 2019Updated 6 years ago
deepc94 / photo-id-ocr
View on GitHub
OpenCV code to extract face and name from government issued ID cards
☆13Dec 27, 2015Updated 10 years ago
szad670401 / OCR_CharGen
View on GitHub
A tools can generate samples for OCR trainning. 用于OCR的字符样本生成工具
☆65Oct 22, 2017Updated 8 years ago
alexbyrnes / Datapiece
View on GitHub
Investigative tool for extracting relevant areas from many documents
☆14Nov 17, 2015Updated 10 years ago
ocropus / hocr-tools
View on GitHub
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
☆416Aug 10, 2024Updated last year
mdbecker / pydata_2013
View on GitHub
PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"
☆11Jan 5, 2014Updated 12 years ago
DanBloomberg / leptonica
View on GitHub
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …
☆2,063Jul 12, 2026Updated 2 weeks ago
LondonTrash / LondonTrash
View on GitHub
☆15Jun 19, 2014Updated 12 years ago
Shreeshrii / tess5train-fonts
View on GitHub
Files and Scripts to run Tesseract 5 LSTM Training using fonts
☆78Feb 6, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kba / awesome-ocr
View on GitHub
Links to awesome OCR projects
☆3,112Jul 6, 2024Updated 2 years ago
sturkmen72 / dlib_pedestrian_detection
View on GitHub
☆13Oct 1, 2017Updated 8 years ago
jeremybmerrill / wayback2csv
View on GitHub
transform a datapoint from a website into a CSV time-series dataset using the wayback machine
☆12May 24, 2023Updated 3 years ago
NVlabs / ocropus3
View on GitHub
Repository collecting all the submodules for the new PyTorch-based OCR System.
☆141Feb 22, 2021Updated 5 years ago
tesseract-ocr / tessdata_best
View on GitHub
Best (most accurate) trained LSTM models.
☆1,568Mar 9, 2024Updated 2 years ago
OCR4all / LAREX
View on GitHub
A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
☆198Updated this week
jmuyskens / nicar18-data-blitz-goes-16
View on GitHub
☆17Mar 8, 2018Updated 8 years ago
dannguyen / datajournalism-primer
View on GitHub
a general list of resources and articles for people interested in getting into data journalism
☆16Apr 12, 2023Updated 3 years ago
Early-Modern-OCR / TesseractTraining
View on GitHub
Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)
☆37Sep 24, 2015Updated 10 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
opentechinstitute / 990-scraper
View on GitHub
Grab nonprofit tax information from the ProPublica API and put it in a Google spreadsheet!
☆14Jun 2, 2017Updated 9 years ago
robbarry / nicar19-internetwar
View on GitHub
☆19Mar 20, 2019Updated 7 years ago
guidefreitas / mser_sift_image_search
View on GitHub
Project using MSER and SIFT descriptors to find similiar images.
☆11Aug 21, 2014Updated 11 years ago
AliceSum / Deep_learning_Coloring-Anime-image-and-satellite-image-house-damge-level-colorized
View on GitHub
☆15Oct 4, 2022Updated 3 years ago
tmbdev-talks / das2018-tutorial
View on GitHub
A tutorial on the PyTorch-based ocropus components.
☆73Apr 18, 2020Updated 6 years ago
vip30 / WebPush-NetCore
View on GitHub
WebPush .Net Core Version
☆12Mar 5, 2019Updated 7 years ago
cppan / tesseract_example
View on GitHub
Very basic Tesseract-OCR example with CPPAN. Cppan support is discontinued. Please use sw (cppan v2) instead. Updated example is here: ht…
☆31Jul 9, 2018Updated 8 years ago
hs105 / Deep-Learning-for-OCR
View on GitHub
This is a reading list for deep learning for OCR
☆343Nov 4, 2017Updated 8 years ago
OctoinCoin / octoin
View on GitHub
OctoinCoin
☆43Feb 20, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xiaomaxiao / keras_ocr
View on GitHub
用keras实现OCR定位、识别
☆529Apr 21, 2019Updated 7 years ago
overview / docs2csv
View on GitHub
Scan a folder of document files of all types and extract the text into a CSV suitable for Overview
☆26Mar 23, 2016Updated 10 years ago
ZJULearning / DREN
View on GitHub
DREN:Deep Rotation Equivirant Network
☆16Mar 24, 2019Updated 7 years ago
microsoft / InsiderDevTour18-Labs
View on GitHub
Source for additional lab material
☆17Jul 16, 2023Updated 3 years ago
Hamza5 / multilevel-diacritizer
View on GitHub
Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.
☆24Jul 25, 2023Updated 3 years ago
definite-app / smallpond
View on GitHub
A lightweight data processing framework built on DuckDB and 3FS.
☆22Mar 2, 2025Updated last year
chuharev / grnti-grabber
View on GitHub
Tools for handling GRNTI list
☆10Sep 2, 2023Updated 2 years ago