navchandar / look-like-scannedLinks
Python package to make documents look like they were scanned
☆45Updated last week
Alternatives and similar repositories for look-like-scanned
Users that are interested in look-like-scanned are comparing it to the libraries listed below
Sorting:
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆291Updated last month
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆106Updated 2 years ago
- PDF417 Decoder available in Python☆63Updated 3 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆135Updated 3 weeks ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆323Updated last year
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆59Updated 4 months ago
- Detect and read handwritten words on scanned pages.☆122Updated last year
- OCRmyPDF EasyOCR plugin☆86Updated 3 months ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Updated 3 years ago
- Extracts iframes or keyframes from a video file, through the command line or from inside python.☆17Updated 2 years ago
- Awesome list dedicated to Windows Subsystem for Linux☆24Updated last year
- Library used to deskew a scanned document☆473Updated last week
- A deep learning toolkit specialized for handwritten document analysis☆240Updated 10 months ago
- The Image Comments Visual Studio Code extension lets you easily add visual comments such as sketches or diagrams directly into your sourc…☆9Updated last week
- Document image dewarping library using a cubic sheet model☆162Updated last week
- CLI tool to extract (meta)data from PDF and manipulate PDF files☆158Updated 2 weeks ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆153Updated last year
- OCR engine for all the languages☆849Updated last week
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆69Updated last year
- A curated list of resources around PDF files☆135Updated 11 months ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆110Updated 2 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆13Updated 3 years ago
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆58Updated last year
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆18Updated 8 months ago
- Pretrained mixed models to be used with Calamari.☆63Updated 9 months ago
- Repository for the EM German Model☆110Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆121Updated last year
- ☆46Updated 2 years ago
- Document scanner written in python using OpenCV and other Computer Vision libraries. Scans image of documents and creates scanned version…☆30Updated 5 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆141Updated 2 months ago