gkovacs / pdfocrLinks
Adds text to PDF files using the cuneiform OCR software
☆326Updated 4 years ago
Alternatives and similar repositories for pdfocr
Users that are interested in pdfocr are comparing it to the libraries listed below
Sorting:
- A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk☆293Updated 2 years ago
- Pi Scan is a simple, robust capture appliance for book scanners. It runs on a Raspberry Pi 2.☆282Updated 7 years ago
- A post-processing tool for scanned sheets of paper.☆1,111Updated last year
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆261Updated 9 years ago
- Bash Script to Scale and Resize PDFs using Ghostscript☆269Updated last year
- a modern, minimalist javascript photo gallery☆252Updated 6 years ago
- web interface for recoll desktop search☆290Updated 5 years ago
- Removes all "Social DRM" from booXtream ePub files☆185Updated 6 years ago
- Python script to do PDF OCR conversion using Tesseract☆376Updated 2 years ago
- Modular workflow assistant for book digitization☆128Updated 9 years ago
- Legacy I, Librarian - collaborative PDF manager. Not maintained, new version is at https://github.com/mkucej/i-librarian-free☆99Updated 2 years ago
- Linux client for Box.com☆284Updated 6 years ago
- Convert Audible .aa files to mp3☆42Updated 9 years ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 12 years ago
- Scripts for data acquisition with paper based surveys☆191Updated 10 months ago
- Create a git repository from the revision history of a document in Google Drive.☆134Updated 7 years ago
- Command-line interface to Amazon Glacier☆614Updated 2 years ago
- Gobby collaborative editor☆581Updated last year
- Generate OpenDocument Presentation (odp) files from markdown☆110Updated 5 months ago
- Using Jekyll to create outputs that can be used as Pandoc inputs. In short - input markdown, output mobi, epub, pdf, and print-ready pdf.…☆248Updated 4 years ago
- FUSE Filesystem 4 Dropbox☆123Updated 6 years ago
- A modern GNU/Linux firewall for GNOME☆389Updated 7 years ago
- An extendible and configurable PDF manipulation layer library written in java.☆531Updated 3 weeks ago
- Semantic filesystem for Linux, with relation reasoner, autotagging plugins and a deduplication service☆317Updated 6 years ago
- pdf watermark removal library for academic papers☆552Updated 5 years ago
- JBIG2 Encoder☆35Updated 6 months ago
- a collection of useful unix commands/scripts/etc.☆66Updated 7 years ago
- PDF to ODT format converter☆97Updated 3 years ago
- OPTical ARchiver - highly compressed 2D barcode for paper or film archiving of digital data☆118Updated 4 years ago
- Humane Heritage - OLD VERSION☆113Updated 5 years ago