Tesseract documentation
☆2,303Feb 23, 2026Updated last week
Alternatives and similar repositories for tessdoc
Users that are interested in tessdoc are comparing it to the libraries listed below
Sorting:
- Tesseract Open Source OCR Engine (main repository)☆72,688Updated this week
- Best (most accurate) trained LSTM models.☆1,509Mar 9, 2024Updated last year
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,411Mar 9, 2024Updated last year
- Train Tesseract LSTM with make☆714Apr 18, 2025Updated 10 months ago
- Tesseract Open Source OCR Engine (main repository)☆4,122Feb 20, 2026Updated 2 weeks ago
- A Python wrapper for Google Tesseract☆6,318Jan 19, 2026Updated last month
- Fast integer versions of trained LSTM models☆595Aug 1, 2024Updated last year
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,030Dec 5, 2025Updated 3 months ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Feb 6, 2022Updated 4 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,369Updated this week
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆32,798Feb 21, 2026Updated last week
- Repository for tesseract testing☆35Jun 9, 2024Updated last year
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,028Feb 28, 2026Updated last week
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆95,527Dec 15, 2025Updated 2 months ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆164,248Updated this week
- Dockerized example to train Tesseract v. 4☆63Dec 8, 2022Updated 3 years ago
- Links to awesome OCR projects☆3,088Jul 6, 2024Updated last year
- A Python wrapper for the tesseract-ocr API☆2,157Jan 13, 2026Updated last month
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,462Updated this week
- ALTO XML schema - latest and all former versions☆55Jan 20, 2026Updated last month
- Open Source Computer Vision Library☆86,391Updated this week
- Zotero Plugin for OCR☆752Feb 15, 2026Updated 2 weeks ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,895Feb 9, 2026Updated 3 weeks ago
- OCR engine for all the languages☆956Feb 25, 2026Updated last week
- 🦜🔗 The platform for reliable agents.☆127,809Updated this week
- Stable Diffusion web UI☆161,451Updated this week
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- Pure Javascript OCR for more than 100 Languages 📖🎉🖥☆37,897Feb 28, 2026Updated last week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,146Feb 27, 2026Updated last week
- Community maintained fork of pdfminer - we fathom PDF☆6,909Feb 24, 2026Updated last week
- LLM inference in C/C++☆96,322Updated this week
- ImageMagick is a free, open-source software suite for creating, editing, converting, and displaying images. It supports 200+ formats and …☆15,844Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆125,513Updated this week
- Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions☆91,916Feb 20, 2026Updated 2 weeks ago
- A feature-rich command-line audio/video downloader☆149,202Feb 26, 2026Updated last week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,171May 27, 2025Updated 9 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,942Jul 23, 2024Updated last year