gkovacs / pdfocrLinks
Adds text to PDF files using the cuneiform OCR software
☆327Updated 4 years ago
Alternatives and similar repositories for pdfocr
Users that are interested in pdfocr are comparing it to the libraries listed below
Sorting:
- A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk☆295Updated 2 years ago
- a modern, minimalist javascript photo gallery☆252Updated 6 years ago
- A post-processing tool for scanned sheets of paper.☆1,122Updated last year
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆261Updated 9 years ago
- Modular workflow assistant for book digitization☆130Updated 9 years ago
- Pi Scan is a simple, robust capture appliance for book scanners. It runs on a Raspberry Pi 2.☆284Updated 7 years ago
- Bash Script to Scale and Resize PDFs using Ghostscript☆272Updated last year
- Convert Audible .aa files to mp3☆42Updated 9 years ago
- web interface for recoll desktop search☆290Updated 5 years ago
- Industry supported, open source PDF/A validation library☆304Updated this week
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 12 years ago
- Removes all "Social DRM" from booXtream ePub files☆184Updated 7 years ago
- Gobby collaborative editor☆582Updated last year
- Ole Tange's personal tools !!! MOVED TO https://gitlab.com/ole.tange/tangetools !!!☆55Updated 9 years ago
- Legacy I, Librarian - collaborative PDF manager. Not maintained, new version is at https://github.com/mkucej/i-librarian-free☆99Updated 2 years ago
- ZBackup, a versatile deduplicating backup tool☆840Updated 3 years ago
- Markdown editor for scientific writing. Batteries included.☆323Updated 9 years ago
- Converter from LaTeX to ebook formats (epub, mobi). Using tex4ht and texlua scripts.☆355Updated last month
- Create a git repository from the revision history of a document in Google Drive.☆134Updated 8 years ago
- Apple's Time Machine fuse read only file system☆258Updated last year
- imapfw (IMAP/mail framework)☆467Updated 7 years ago
- Scripts for data acquisition with paper based surveys☆192Updated 11 months ago
- Semantic filesystem for Linux, with relation reasoner, autotagging plugins and a deduplication service☆318Updated 7 years ago
- Deduplicating backup program☆1,103Updated 4 years ago
- An extendible and configurable PDF manipulation layer library written in java.☆532Updated 3 weeks ago
- Polar devices Python API and CLI.☆147Updated 4 years ago
- Command-line interface to Amazon Glacier☆614Updated last month
- [DEPRECATED - please use rups instead] RUPS is an abbreviation for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText®…☆111Updated 7 years ago
- Generate OpenDocument Presentation (odp) files from markdown☆110Updated 6 months ago
- MOVED TO https://gitlab.com/crossref/pdfextract☆510Updated 8 years ago