tesseract-ocr / tessdocLinks
Tesseract documentation
☆2,148Updated 2 weeks ago
Alternatives and similar repositories for tessdoc
Users that are interested in tessdoc are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,406Updated last year
- Fast integer versions of trained LSTM models☆563Updated last year
- Train Tesseract LSTM with make☆690Updated 4 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,726Updated last month
- Source training data for Tesseract for lots of languages☆859Updated 4 months ago
- Tesseract Open Source OCR Engine (main repository)☆69,073Updated 2 weeks ago
- A Python wrapper for Google Tesseract☆6,204Updated 2 weeks ago
- OCR engine for all the languages☆867Updated last week
- A Python wrapper for the tesseract-ocr API☆2,113Updated 3 weeks ago
- Demos, examples and utilities using PyMuPDF☆677Updated last year
- Download Poppler binaries packaged for Windows with dependencies☆900Updated last week
- A Gtk/Qt front-end to tesseract-ocr.☆1,807Updated last week
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,462Updated last year
- Line based ATR Engine based on OCRopy☆1,157Updated 3 months ago
- Various documents related to Tesseract OCR☆266Updated 3 years ago
- Links to awesome OCR projects☆3,036Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,299Updated last week
- Box editor and trainer for Tesseract OCR☆245Updated 2 months ago
- mupdf mirror☆2,282Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆7,897Updated this week
- ☆146Updated 5 years ago
- Library used to deskew a scanned document☆477Updated last week
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆574Updated 2 years ago
- Data used for LSTM model training☆119Updated last year
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆248Updated 2 years ago
- Python-based tools for document analysis and OCR☆3,462Updated 4 years ago
- A synthetic data generator for text recognition☆3,550Updated last year
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆296Updated 3 months ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆1,303Updated last year