gkovacs / pdfocrLinks
Adds text to PDF files using the cuneiform OCR software
☆329Updated 4 years ago
Alternatives and similar repositories for pdfocr
Users that are interested in pdfocr are comparing it to the libraries listed below
Sorting:
- A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk☆296Updated 2 years ago
- Extract tables from PDF files☆359Updated 9 years ago
- Convert Audible .aa files to mp3☆42Updated 10 years ago
- a modern, minimalist javascript photo gallery☆252Updated 6 years ago
- Legacy I, Librarian - collaborative PDF manager. Not maintained, new version is at https://github.com/mkucej/i-librarian-free☆99Updated 3 years ago
- Python script to do PDF OCR conversion using Tesseract☆375Updated 2 years ago
- Modular workflow assistant for book digitization☆131Updated 9 years ago
- An extendible and configurable PDF manipulation layer library written in java.☆537Updated last week
- PDF to ODT format converter☆97Updated 3 years ago
- Removes all "Social DRM" from booXtream ePub files☆187Updated 7 years ago
- imapfw (IMAP/mail framework)☆467Updated 7 years ago
- A modern GNU/Linux firewall for GNOME☆387Updated 7 years ago
- A presenter console with multi-monitor support for PDF files.☆211Updated 10 years ago
- Gobby collaborative editor☆584Updated 2 years ago
- Semantic filesystem for Linux, with relation reasoner, autotagging plugins and a deduplication service☆324Updated 7 years ago
- Deduplicating backup program☆1,101Updated 4 years ago
- [DEPRECATED - please use rups instead] RUPS is an abbreviation for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText®…☆111Updated 7 years ago
- Legacy Python 2 version of AutoKey, the desktop automation utility for Linux and X11.☆207Updated 6 years ago
- a set of tools to help with securely redacting and stripping metadata from documents before publishing☆550Updated 5 years ago
- ZBackup, a versatile deduplicating backup tool☆842Updated 3 years ago
- A simple converter from OpenDocument Text to plain text☆88Updated 7 years ago
- pdf watermark removal library for academic papers☆556Updated 5 years ago
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆451Updated 2 years ago
- Store and restore metadata from a filesystem.☆178Updated 2 years ago
- random files I've been collecting & programming☆246Updated 4 months ago
- OPTical ARchiver - highly compressed 2D barcode for paper or film archiving of digital data☆121Updated 4 years ago
- MOVED TO https://gitlab.com/crossref/pdfextract☆510Updated 8 years ago
- Email::Message Perl module for reading Outlook .msg files☆193Updated 3 weeks ago
- Apple's Time Machine fuse read only file system☆259Updated last year
- Scripts for data acquisition with paper based surveys☆195Updated last year