madmaze / pytesseract
A Python wrapper for Google Tesseract
☆5,971Updated 2 weeks ago
Alternatives and similar repositories for pytesseract:
Users that are interested in pytesseract are comparing it to the libraries listed below
- A Python wrapper for the tesseract-ocr API☆2,042Updated last month
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆931Updated 6 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,683Updated 5 months ago
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,595Updated 10 months ago
- Python job scheduling for humans.☆11,934Updated 7 months ago
- Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame☆2,218Updated last month
- Python-based tools for document analysis and OCR☆3,432Updated 3 years ago
- extract text from any document. no muss. no fuss.☆3,956Updated last month
- Tesseract Open Source OCR Engine (main repository)☆63,831Updated last week
- Best (most accurate) trained LSTM models.☆1,276Updated 10 months ago
- A Python library for reading and writing PDF, powered by QPDF☆2,234Updated 2 weeks ago
- Tesseract Open Source OCR Engine (main repository)☆3,257Updated last month
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,275Updated 2 years ago
- Requests + Gevent = <3☆4,508Updated 5 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,281Updated 5 months ago
- Source training data for Tesseract for lots of languages☆845Updated 10 months ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,083Updated 2 weeks ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,606Updated 6 months ago
- Camelot: PDF Table Extraction for Humans☆3,679Updated 2 years ago
- Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-fa…☆7,807Updated 2 months ago
- Asynchronous HTTP client/server framework for asyncio and Python☆15,322Updated this week
- Retrying library for Python☆6,936Updated 2 months ago
- Pythonic HTML Parsing for Humans™☆13,770Updated 9 months ago
- The lxml XML toolkit for Python☆2,745Updated this week
- Useful extensions to the standard Python datetime features☆2,404Updated 5 months ago
- Python Data. Leaflet.js Maps.☆6,995Updated last week
- A next generation HTTP client for Python. 🦋☆13,573Updated this week
- A Python library to extract tabular data from PDFs☆3,108Updated this week
- 🏹 Better dates & times for Python☆8,768Updated last month
- A Python library for automating interaction with websites.☆4,694Updated 2 months ago