h / pytesseract
Python-tesseract is an optical character recognition (OCR) tool for python
☆105Updated 6 years ago
Alternatives and similar repositories for pytesseract:
Users that are interested in pytesseract are comparing it to the libraries listed below
- A template repo holding our common setup for a python project☆102Updated 2 years ago
- Detect and read handwritten words on scanned pages.☆115Updated last year
- Object Detection Model for Scanned Documents☆86Updated last year
- Recognition of handwritten text using CRAFT text detection and TrOCR☆25Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆187Updated 3 weeks ago
- Data extraction with Donut ML model☆56Updated 5 months ago
- Streamlit component for invoice document labeling☆56Updated 2 years ago
- Proceed with text detection only in the selected area of the image☆175Updated 11 months ago
- Streamlit PDF viewer☆121Updated last week
- Experiment and integrate with different OCR frameworks seamlessly☆104Updated 9 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆131Updated 2 weeks ago
- 🖼️ An image select component for Streamlit☆115Updated 10 months ago
- Python bindings to PDFium☆495Updated this week
- ☆39Updated 2 years ago
- ☆8Updated 4 years ago
- A Python asyncio wrapper for Tesseract-OCR.☆22Updated 3 months ago
- NanoNets OCR API Example for Python☆182Updated 2 years ago
- multilingual RAG☆12Updated 11 months ago
- Benchmarking PDF libraries☆250Updated last year
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆26Updated last year
- Document Layout Analysis☆359Updated last week
- Repository for deepdoctection tutorial notebooks☆40Updated 2 months ago
- [CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks☆366Updated this week
- ☆14Updated last year
- Read and modify image metadata in Python with exif☆28Updated 3 years ago
- Extract structured text from pdfs quickly☆393Updated this week
- Access and change cookies from your Streamlit script☆64Updated 7 months ago
- Tutorial on how to deskew (straighten) text images☆51Updated 2 years ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆56Updated this week
- Huggingface utilities for Ultralytics/YOLOv8☆83Updated 11 months ago