A curated list of awesome projects to simplify and improve paper and document scanning.
☆512May 12, 2026Updated last week
Alternatives and similar repositories for awesome-scanning
Users that are interested in awesome-scanning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- ☆1,789Nov 29, 2020Updated 5 years ago
- A collection of tools for cleaning up book scans.☆148Dec 8, 2022Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆1,179Jul 11, 2024Updated last year
- Web application for transcribing OCR ground truth from Archive.org☆18Feb 22, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆255Apr 7, 2026Updated last month
- Android application for scanning and manipulating handwritten notes and documents.☆1,520Apr 23, 2025Updated last year
- Convert scans of handwritten notes to beautiful, compact Images☆15Jun 21, 2022Updated 3 years ago
- A repository for typefaces to train Tesseract and OCRopus for natural history collections and digital humanities.☆9Dec 13, 2014Updated 11 years ago
- Scan Tailor Experimental is an interactive post-processing tool for scanned pages.☆128May 4, 2026Updated 2 weeks ago
- An app store for CLI apps.☆12Apr 3, 2025Updated last year
- Document Scanner that protects your privacy☆1,761Apr 5, 2026Updated last month
- Document Boundary & Canny Edge Detection using OpenCV☆69Oct 12, 2018Updated 7 years ago
- interactive, customizable semantic web visualization☆15Dec 27, 2025Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Efficient hOCR tooling☆57Aug 18, 2025Updated 9 months ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- A list of Japanese Youtube channels with japanese subtitles. So you can easily mine Anki cards with a tool like MPV.☆43Sep 15, 2024Updated last year
- Homebrew formula and App bundler for Scantailor (Advanced)☆181Jan 26, 2026Updated 3 months ago
- A social media open post web archiving tool☆26Feb 4, 2026Updated 3 months ago
- My website & blog with articles about coding, tech, functional programming, …☆10Mar 28, 2026Updated last month
- QA-tool for scans with corresponding ALTO-files☆27Dec 2, 2022Updated 3 years ago
- A platform for teachers and students to share and collaborate on exercises☆10May 14, 2026Updated last week
- Export a JSON archive of a Gitter room's messages☆15Sep 26, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The fake data generator☆18Apr 18, 2026Updated last month
- Ensure that your Johnny Decimal system is neat and clean.☆18Nov 14, 2024Updated last year
- Ansible Playbook to Setup a NAS for backups and media access☆12Aug 4, 2025Updated 9 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Convert Kindle Clippings to Objects that conform with W3C Web Annotation Vocabulary☆14Sep 18, 2017Updated 8 years ago
- Wireshark dissector for the Nix daemon protocol.☆15Sep 11, 2025Updated 8 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆139Mar 2, 2026Updated 2 months ago
- A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping☆109Nov 2, 2022Updated 3 years ago
- Scan, index, and archive all of your paper documents☆7,919Apr 6, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kexec into an in-memory emergency system☆33Mar 1, 2022Updated 4 years ago
- RObust document image BINarization☆184Aug 2, 2024Updated last year
- 3D-printed spare parts for SGI - Silicon Graphics Computer Systems☆19Feb 14, 2025Updated last year
- Document image dewarping library using a cubic sheet model☆217Updated this week
- low tech iiif annotations via jekyll 📜📝☆13Jun 5, 2024Updated last year
- Language Model and Text Classification for German Language using Deep Learning☆18Jun 15, 2018Updated 7 years ago
- Docker for ScanTailor and ScanTailor Advanced☆14Mar 17, 2024Updated 2 years ago