h / pytesseractLinks
Python-tesseract is an optical character recognition (OCR) tool for python
☆169Updated 7 years ago
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- The official Python Library for the Groq API☆549Updated last week
- Aspose.Words for Python via .NET examples and showcases☆126Updated 3 weeks ago
- A template repo holding our common setup for a python project☆121Updated 2 years ago
- Official Python SDK for Deepgram.☆349Updated this week
- Python bindings to PDFium, reasonably cross-platform.☆647Updated last week
- AssemblyAI's Official Python SDK☆194Updated 2 weeks ago
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆476Updated last week
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆116Updated 2 years ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆792Updated last month
- Ultralytics Notebooks 🚀☆111Updated last month
- A simple tool for automatic image annotation using Roboflow API☆46Updated 2 years ago
- Demos, examples and utilities using PyMuPDF☆683Updated last year
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,870Updated last year
- Streamlit PDF viewer☆177Updated 2 weeks ago
- ☆136Updated 2 weeks ago
- An efficient OCR engine for receipt image processing.☆132Updated last week
- ☆113Updated 10 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆218Updated 9 months ago
- a Python client library for SerpApi.☆97Updated 2 weeks ago
- Python client library for Mistral AI platform☆653Updated last week
- A Python client for the Unstructured Platform API☆107Updated this week
- Library used to deskew a scanned document☆485Updated this week
- Extract structured text from pdfs quickly☆605Updated 3 months ago
- The best way to use Selenium in Google Colab Notebooks!☆260Updated 3 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆154Updated 3 weeks ago
- CPU compatible fork of the official SAMv2 implementation aimed at more accessible and documented tutorials☆79Updated last month
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆21Updated 11 months ago
- Repository for deepdoctection tutorial notebooks☆46Updated 3 months ago
- A Python asyncio wrapper for Tesseract-OCR.☆26Updated last week
- Python Library for Accessing the Cohere API☆363Updated last week