johndoe31415 / pdfminifyLinks
PDF minifier that allows removing duplicate data, re-compresses images, creation of PDF/A-1b and digital PDF signing
☆55Updated last year
Alternatives and similar repositories for pdfminify
Users that are interested in pdfminify are comparing it to the libraries listed below
Sorting:
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆228Updated 3 weeks ago
- Code repository for PDFStitcher, a utility to stitch together and modify line properties of PDF sewing patterns.☆169Updated this week
- A general purpose PDF text-layer redaction tool for Python 2/3.☆206Updated last year
- ☆17Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- Building scantailor and its dependencies☆64Updated 2 years ago
- A post-processing tool for scanned sheets of paper.☆1,130Updated last year
- Get semantic HTML from PDFs, recover lost text, tables, data... in bulk.☆36Updated last year
- Create beautiful documents with data. Open source pdf (and Scribus) template and mail-merge alternative.☆278Updated 11 months ago
- Automatically crop and rotate scanned images using OpenCV☆123Updated 2 years ago
- ☆20Updated 11 months ago
- RUPS is an acronym for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText® that allows you to look inside a PDF docume…☆340Updated this week
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆299Updated 6 months ago
- A low-level PDF creator☆138Updated last week
- A tiny CSS parser☆181Updated 3 weeks ago
- JBIG2 Encoder☆44Updated 9 months ago
- API for OpenDocument in Python☆345Updated 3 months ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated last year
- Python bindings for lib7zip☆35Updated 5 years ago
- A cross-platform utility to join, split, stamp, and rotate PDFs written in Python. Yes, Python!☆39Updated 2 years ago
- Pack a webpage including images and CSS into a single HTML file☆91Updated 4 years ago
- Python scanning library for cross platform environment based on SANE and TWAIN☆26Updated 9 years ago
- Advanced Duplicate File Finder for Python☆80Updated 5 years ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆61Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆404Updated last year
- Standalone reStructuredText editor with live preview (native app)☆55Updated 2 years ago
- Safe and fast evaluation of untrusted user-supplied python expressions☆35Updated 2 weeks ago
- Repository with python scripts from the Scribus community☆46Updated last month
- Python interface to libarchive☆80Updated 6 months ago
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆41Updated last year