madmaze / pytesseract
A Python wrapper for Google Tesseract
☆6,081Updated 3 weeks ago
Alternatives and similar repositories for pytesseract:
Users that are interested in pytesseract are comparing it to the libraries listed below
- A Python wrapper for the tesseract-ocr API☆2,084Updated 2 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,856Updated last year
- Community maintained fork of pdfminer - we fathom PDF☆6,369Updated last week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆6,965Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆26,348Updated 6 months ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,286Updated 2 years ago
- Python-based tools for document analysis and OCR☆3,448Updated 3 years ago
- extract text from any document. no muss. no fuss.☆4,072Updated 4 months ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)☆3,833Updated 9 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,760Updated 8 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,475Updated 5 months ago
- Best (most accurate) trained LSTM models.☆1,329Updated last year
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆928Updated 6 years ago
- Python Imaging Library (Fork)☆12,722Updated this week
- Tesseract Open Source OCR Engine (main repository)☆66,229Updated 3 weeks ago
- Tesseract documentation☆2,005Updated 2 months ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,890Updated 3 weeks ago
- Links to awesome OCR projects☆2,954Updated 9 months ago
- A library for converting HTML into PDFs using ReportLab☆2,296Updated last month
- A python wrapper for libmagic☆2,742Updated last month
- The ctypes-based simple ImageMagick binding for Python☆1,440Updated 2 weeks ago
- Wkhtmltopdf python wrapper to convert html to pdf☆2,016Updated last year
- Freeze (package) Python programs into stand-alone executables☆12,292Updated this week
- The lxml XML toolkit for Python☆2,808Updated this week
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆8,952Updated this week
- A Python library for reading and writing PDF, powered by QPDF☆2,320Updated this week
- Source training data for Tesseract for lots of languages☆853Updated 3 weeks ago
- Simple PDF generation for Python (FPDF PHP port)☆875Updated 8 months ago
- Simple job queues for Python☆10,119Updated this week
- a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb☆11,489Updated 2 weeks ago