tesseract-ocr / tessdoc
Tesseract documentation
☆1,901Updated last month
Alternatives and similar repositories for tessdoc:
Users that are interested in tessdoc are comparing it to the libraries listed below
- Best (most accurate) trained LSTM models.☆1,276Updated 10 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,257Updated last month
- Fast integer versions of trained LSTM models☆503Updated 5 months ago
- OCR engine for all the languages☆767Updated this week
- Source training data for Tesseract for lots of languages☆845Updated 10 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,683Updated 5 months ago
- Links to awesome OCR projects☆2,866Updated 6 months ago
- Line based ATR Engine based on OCRopy☆1,065Updated 2 months ago
- Train Tesseract LSTM with make☆653Updated 7 months ago
- A Python wrapper for Google Tesseract☆5,971Updated 2 weeks ago
- A Python wrapper for the tesseract-ocr API☆2,042Updated last month
- ☆142Updated 4 years ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆629Updated 2 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆375Updated 5 months ago
- ☆912Updated 4 months ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,673Updated last week
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆511Updated 3 years ago
- A Python library to extract tabular data from PDFs☆3,108Updated this week
- A Python library for reading and writing PDF, powered by QPDF☆2,234Updated 2 weeks ago
- Box editor and trainer for Tesseract OCR☆234Updated 6 months ago
- Data used for LSTM model training☆116Updated 10 months ago
- Various documents related to Tesseract OCR☆263Updated 3 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- Extract tables from scanned image PDFs using Optical Character Recognition.☆271Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆270Updated 5 months ago
- Demos, examples and utilities using PyMuPDF☆612Updated 6 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆4,151Updated this week
- ☆940Updated 2 years ago
- Library used to deskew a scanned document☆434Updated last week
- Document Layout Analysis☆359Updated 3 weeks ago