h / pytesseractLinks
Python-tesseract is an optical character recognition (OCR) tool for python
☆154Updated 7 years ago
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆110Updated 2 years ago
- Detect and read handwritten words on scanned pages.☆124Updated 2 years ago
- OCR engine for all the languages☆852Updated last week
- Library used to deskew a scanned document☆473Updated last week
- ☆372Updated last year
- Python bindings to PDFium, reasonably cross-platform.☆597Updated this week
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆769Updated 5 months ago
- The official Python Library for the Groq API☆528Updated last week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆434Updated this week
- Train Tesseract LSTM with make☆687Updated 3 months ago
- A template repo holding our common setup for a python project☆108Updated 2 years ago
- Aspose.Words for Python via .NET examples and showcases☆124Updated last week
- Benchmarking PDF libraries☆297Updated 3 weeks ago
- Demos, examples and utilities using PyMuPDF☆670Updated last year
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆987Updated this week
- Checkbox Detection Model for Scanned Documents☆76Updated 4 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,832Updated last year
- Proceed with text detection only in the selected area of the image☆224Updated last year
- Ultralytics Notebooks 🚀☆93Updated this week
- 📚 Process PDFs, Word documents and more with spaCy☆680Updated 4 months ago
- ☆66Updated 2 years ago
- Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8☆305Updated 5 months ago
- Streamlit PDF viewer☆162Updated 3 weeks ago
- Updating this repo every week, You may want to STAR it :)☆69Updated 11 months ago
- The best way to use Selenium in Google Colab Notebooks!☆249Updated 3 weeks ago
- Extract structured text from pdfs quickly☆514Updated last month
- Official Python client library for LinkedIn APIs☆215Updated last year
- Machine Learning Training Utilities (for TensorFlow and PyTorch)☆244Updated 3 months ago
- A python library to make filling pdfs much easier☆152Updated 11 months ago
- A web utility to draw polygons and retrieve their coordinates for computer vision applications.☆78Updated 9 months ago