h / pytesseractLinks
Python-tesseract is an optical character recognition (OCR) tool for python
☆179Updated 2 months ago
Alternatives and similar repositories for pytesseract
Users that are interested in pytesseract are comparing it to the libraries listed below
Sorting:
- A template repo holding our common setup for a python project☆123Updated 3 years ago
- Python bindings to PDFium, reasonably cross-platform.☆699Updated last week
- The official Python Library for the Groq API☆567Updated last week
- Demos, examples and utilities using PyMuPDF☆693Updated last year
- git mirror for Beautiful Soup 4.3.2☆197Updated 3 years ago
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆125Updated 3 years ago
- Benchmarking PDF libraries☆316Updated 6 months ago
- Object Detection Model for Scanned Documents☆93Updated 9 months ago
- ☆389Updated last year
- Library used to deskew a scanned document☆495Updated this week
- Embed ngrok secure ingress into your Python apps with a single line of code.☆174Updated 2 weeks ago
- a Python client library for SerpApi.☆112Updated last month
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆144Updated 2 years ago
- Python client for Typesense: https://github.com/typesense/typesense☆230Updated last month
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆235Updated last year
- A Python client for the Unstructured Platform API☆111Updated this week
- OCR engine for all the languages☆927Updated 2 weeks ago
- Updating this repo every week, You may want to STAR it :)☆70Updated last year
- Aspose.Words for Python via .NET examples and showcases☆131Updated 3 weeks ago
- Experiment and integrate with different OCR frameworks seamlessly☆102Updated last year
- AssemblyAI's Official Python SDK☆201Updated 2 weeks ago
- Python client for Qdrant vector search engine☆1,190Updated 2 weeks ago
- Official Python SDK for Deepgram.☆379Updated last week
- The best way to use Selenium in Google Colab Notebooks!☆273Updated 2 months ago
- ☆66Updated 2 years ago
- Extract structured text from pdfs quickly☆643Updated 6 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆832Updated 9 months ago
- OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes☆82Updated 3 months ago
- Detect and read handwritten words on scanned pages.☆134Updated 2 years ago
- ☆147Updated 5 years ago