h / pytesseract
Python-tesseract is an optical character recognition (OCR) tool for python
☆72Updated 6 years ago
Related projects: ⓘ
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆60Updated this week
- A template repo holding our common setup for a python project☆82Updated last year
- Experiment and integrate with different OCR frameworks seamlessly☆104Updated 5 months ago
- Some information for working with the Together inference API for Open Source AI models☆57Updated 8 months ago
- Object Detection Model for Scanned Documents☆77Updated 11 months ago
- Python bindings to PDFium☆349Updated this week
- Recognition of handwritten text using CRAFT text detection and TrOCR☆24Updated last year
- Detect and read handwritten words on scanned pages.☆99Updated last year
- OCRmyPDF EasyOCR plugin☆44Updated 3 weeks ago
- Unsloth Studio☆21Updated last week
- Library used to deskew a scanned document☆407Updated 2 weeks ago
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆252Updated 2 years ago
- Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…☆58Updated last month
- ☆56Updated 11 months ago
- ☆181Updated 3 months ago
- Integrate AI-powered Document Analysis Pipelines☆58Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆58Updated 2 weeks ago
- An integration of Qdrant ANN vector database backend with Haystack☆42Updated 2 months ago
- Serverless OpenCV image manipulation app with Streamlit☆31Updated 3 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆70Updated 4 months ago
- A Python client for the Unstructured hosted API☆74Updated this week
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆15Updated 7 months ago
- FastServer is simple API based server created in FastApi with Clients in various languages.☆24Updated last year
- ☆18Updated 6 months ago
- Prototype app enabling job description search using natural language description of a job seeker.☆59Updated 3 months ago
- A Streamlit wrapper component on react-smooth-dnd☆15Updated 7 months ago
- Document Layout Analysis☆335Updated this week
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆120Updated 2 months ago
- Python library to extract tabular data from images and scanned PDFs☆255Updated last month
- ☆59Updated 3 weeks ago