h / pytesseractLinks
Python-tesseract is an optical character recognition (OCR) tool for python
☆180Updated 3 months ago
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- Python bindings to PDFium, reasonably cross-platform.☆721Updated this week
- A template repo holding our common setup for a python project☆126Updated last month
- Demos, examples and utilities using PyMuPDF☆706Updated last month
- The official Python Library for the Groq API☆575Updated last week
- Aspose.Words for Python via .NET examples and showcases☆132Updated 3 weeks ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆850Updated 3 months ago
- Detect and read handwritten words on scanned pages.☆136Updated 2 years ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆171Updated last week
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆124Updated 3 years ago
- Library used to deskew a scanned document☆498Updated this week
- Extract structured text from pdfs quickly☆661Updated 7 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆850Updated 11 months ago
- Ultralytics Notebooks 🚀☆191Updated 3 weeks ago
- Benchmarking PDF libraries☆321Updated 7 months ago
- The best way to use Selenium in Google Colab Notebooks!☆277Updated this week
- A Python client for the Unstructured Platform API☆114Updated this week
- Simple package to extract text with coordinates from programmatic PDFs☆238Updated this week
- AI Bots - Robotic Processing automation Python and Julia lang scripts to support automating repetitive tasks☆93Updated last year
- Train Tesseract LSTM with make☆713Updated 9 months ago
- OCR engine for all the languages☆940Updated last week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆533Updated last week
- ☆392Updated 2 years ago
- AssemblyAI's Official Python SDK☆201Updated last week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,933Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆241Updated last year
- Official Python SDK for Deepgram.☆391Updated this week
- Streamlit PDF viewer☆195Updated last week
- ☆28Updated 3 years ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 3 years ago
- ☆114Updated last year