endalk200 / document-scannerLinks
Document scanner written in python using OpenCV and other Computer Vision libraries. Scans image of documents and creates scanned version of the document by running some image manipulations on it.
☆31Updated 11 months ago
Alternatives and similar repositories for document-scanner
Users that are interested in document-scanner are comparing it to the libraries listed below
Sorting:
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆59Updated 3 months ago
- Docscan is a document scanner. Take a photo of your documents and frame it.☆105Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆80Updated 2 weeks ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆167Updated last month
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated 2 years ago
- A Python tool to help extracting information from structured PDFs.☆427Updated last month
- Library used to deskew a scanned document☆497Updated this week
- Aspose.Words for Python via .NET examples and showcases☆131Updated this week
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆154Updated 8 months ago
- A simple python wrapper for PDFium.☆17Updated 4 years ago
- ☆10Updated 5 years ago
- ☆40Updated 5 years ago
- CRUD Word documents with Python☆13Updated last month
- ☆28Updated 3 years ago
- Tutorial on how to deskew (straighten) text images☆52Updated 3 years ago
- Checkbox Detection Model for Scanned Documents☆90Updated 10 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆237Updated last year
- Python library to extract tabular data from images and scanned PDFs☆285Updated last year
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆596Updated 3 years ago
- A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.☆178Updated 3 years ago
- ☆66Updated 2 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆226Updated last month
- Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8☆379Updated 10 months ago
- Small microservice to handle authentification and rights☆19Updated 2 years ago
- Document image dewarping library using a cubic sheet model☆194Updated this week
- A Python asyncio wrapper for Tesseract-OCR.☆27Updated this week
- Extract tables from scanned documents pdf into csv file using ocr and image processing☆141Updated 6 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆34Updated 5 years ago
- Object Detection Model for Scanned Documents☆93Updated 10 months ago
- This is a code generator that allows you to use SQLAlchemy schema to quickly build a FastApi project template with CRUD Api routers and v…☆39Updated 2 years ago