madmaze / pytesseractLinks
A Python wrapper for Google Tesseract
☆6,298Updated last month
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the tesseract-ocr API☆2,146Updated last week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,930Updated last year
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- Tesseract Open Source OCR Engine (main repository)☆72,037Updated 2 weeks ago
- Python Imaging Library (Fork)☆13,329Updated last week
- extract text from any document. no muss. no fuss.☆4,420Updated last year
- Links to awesome OCR projects☆3,077Updated last year
- Community maintained fork of pdfminer - we fathom PDF☆6,863Updated 2 weeks ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 7 years ago
- Line based ATR Engine based on OCRopy☆1,179Updated 8 months ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)☆3,935Updated last year
- Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-head…☆5,158Updated last week
- A synthetic data generator for text recognition☆3,630Updated last year
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,881Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,809Updated last month
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,745Updated last week
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,303Updated 3 years ago
- Tesseract Open Source OCR Engine (main repository)☆4,025Updated last week
- Tesseract documentation☆2,277Updated 2 weeks ago
- A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and…☆4,595Updated last year
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,472Updated 4 months ago
- Best (most accurate) trained LSTM models.☆1,488Updated last year
- Fuzzy String Matching in Python☆3,547Updated 10 months ago
- A Python library for reading and writing PDF, powered by QPDF☆2,608Updated this week
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,349Updated last year
- Faker is a Python package that generates fake data for you.☆19,030Updated last week
- The lxml XML toolkit for Python☆2,984Updated last week
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,010Updated last week
- Train Tesseract LSTM with make☆709Updated 9 months ago
- MySQL client library for Python☆7,843Updated 5 months ago