tesseract-ocr / tessdocLinks
Tesseract documentation
☆2,098Updated last month
Alternatives and similar repositories for tessdoc
Users that are interested in tessdoc are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,376Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,623Updated last week
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,039Updated last year
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,936Updated last week
- Fast integer versions of trained LSTM models☆551Updated 11 months ago
- Train Tesseract LSTM with make☆686Updated 2 months ago
- Source training data for Tesseract for lots of languages☆857Updated 3 months ago
- Download Poppler binaries packaged for Windows with dependencies☆861Updated 7 months ago
- A Python wrapper for Google Tesseract☆6,159Updated 3 weeks ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,828Updated 11 months ago
- Demos, examples and utilities using PyMuPDF☆669Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated 11 months ago
- A Python wrapper for the tesseract-ocr API☆2,107Updated last month
- Links to awesome OCR projects☆3,013Updated last year
- OCR engine for all the languages☆849Updated last week
- Line based ATR Engine based on OCRopy☆1,152Updated 2 months ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,782Updated this week
- ☆143Updated 5 years ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆192Updated last week
- Library used to deskew a scanned document☆473Updated 2 weeks ago
- Box editor and trainer for Tesseract OCR☆241Updated 3 weeks ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆153Updated last year
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆291Updated last month
- finetuned traineddata files for tesseract 4.0.0 for testing☆166Updated 6 years ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆7,560Updated this week
- Extract tables from scanned image PDFs using Optical Character Recognition.☆275Updated 5 years ago
- Python-tesseract is an optical character recognition (OCR) tool for python☆153Updated 7 years ago
- A curated list of awesome projects to simplify and improve paper and document scanning.☆445Updated 2 weeks ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆27,261Updated 9 months ago
- Simple PDF text extraction☆942Updated 5 months ago