ad-si / awesome-scanningLinks
A curated list of awesome projects to simplify and improve paper and document scanning.
☆468Updated 4 months ago
Alternatives and similar repositories for awesome-scanning
Users that are interested in awesome-scanning are comparing it to the libraries listed below
Sorting:
- A post-processing tool for scanned sheets of paper.☆1,122Updated last year
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆298Updated 5 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆403Updated last year
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆255Updated 4 months ago
- ☆1,732Updated 4 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆224Updated this week
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- OCR engine for all the languages☆905Updated this week
- Document image dewarping library using a cubic sheet model☆180Updated last week
- Automatic de-keystoning for single camera DIY book scanners.☆49Updated 5 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆125Updated 2 months ago
- A collection of tools for cleaning up book scans.☆144Updated 2 years ago
- ☆984Updated last year
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆110Updated 2 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆152Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆193Updated 4 months ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆100Updated 7 years ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆350Updated 8 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆196Updated 5 months ago
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆584Updated 3 years ago
- Library used to deskew a scanned document☆491Updated last week
- Perspective recovery of text using transformed ellipses☆150Updated 4 years ago
- Local adaptive image binarization☆126Updated 2 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆131Updated 3 weeks ago
- An application of high resolution GANs to dewarp images of perturbed documents☆148Updated 4 years ago
- Working with hOCR in Javascript☆136Updated 2 years ago
- Line based ATR Engine based on OCRopy☆1,168Updated 6 months ago
- Document Layout Analysis☆391Updated last week