h / pytesseractLinks
Python-tesseract is an optical character recognition (OCR) tool for python
☆156Updated 7 years ago
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- The official Python Library for the Groq API☆540Updated this week
- Python bindings to PDFium, reasonably cross-platform.☆608Updated this week
- Demos, examples and utilities using PyMuPDF☆676Updated last year
- Library used to deskew a scanned document☆475Updated 2 weeks ago
- Benchmarking PDF libraries☆304Updated last month
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆110Updated 2 years ago
- ☆374Updated last year
- Official Python SDK for Deepgram.☆335Updated last week
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆447Updated this week
- An end-to-end signature verification system to extract, clean and verify signatures in documents. Signatures are detected using YOLOv5. N…☆184Updated last year
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆778Updated this week
- Machine Learning Training Utilities (for TensorFlow and PyTorch)☆244Updated 4 months ago
- Aspose.Words for Python via .NET examples and showcases☆124Updated this week
- Official Python client library for LinkedIn APIs☆217Updated last year
- Detect and read handwritten words on scanned pages.☆125Updated 2 years ago
- Streamlit PDF viewer☆169Updated last month
- This is a project that translates a .pdf file, preserving the original layout of that .pdf file. [UPDATED] We have achieved the Second Pr…☆101Updated 9 months ago
- Extract structured text from pdfs quickly☆563Updated 2 months ago
- ☆66Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆211Updated 7 months ago
- Document image dewarping library using a cubic sheet model☆167Updated this week
- A template repo holding our common setup for a python project☆117Updated 2 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,841Updated last year
- The best way to use Selenium in Google Colab Notebooks!☆250Updated last month
- OCR engine for all the languages☆863Updated this week
- Proceed with text detection only in the selected area of the image☆229Updated last year
- Object Detection Model for Scanned Documents☆94Updated 5 months ago
- Ultralytics Notebooks 🚀☆101Updated this week
- Robust QR Detector based on YOLOv8☆147Updated 5 months ago