jnweiger / pdfcompareLinks
compare two PDF files, write a resulting PDF with highlighted changes
☆57Updated last week
Alternatives and similar repositories for pdfcompare
Users that are interested in pdfcompare are comparing it to the libraries listed below
Sorting:
- A PDF comparison utility in Python.☆509Updated last year
- PDF Extraction Toolkit☆42Updated 5 years ago
- Extract tables from PDF pages.☆298Updated 5 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆134Updated 2 years ago
- web interface for recoll desktop search☆293Updated 5 years ago
- OCR for DjVu☆47Updated 3 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 4 years ago
- Convert text from PDF to XML.☆45Updated 7 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated 2 years ago
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year
- smoothscan is a tool to convert scanned text into a vectorized output form.☆68Updated 12 years ago
- A simple viewer and inspection tool for text boxes in PDF documents☆96Updated 3 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 10 months ago
- ☆39Updated 10 years ago
- a utility to extract the title from a PDF file☆144Updated 11 months ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆216Updated 6 years ago
- Zorba - the NoSQL processor☆42Updated 2 years ago
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- Converts XML to LaTeX☆45Updated 2 weeks ago
- Compare documents using MS Word from the command line.☆137Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated 2 years ago
- CiteSeerX public repository☆135Updated last year
- Docear's desktop version (GPL)☆304Updated 3 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆35Updated 10 years ago
- Legacy I, Librarian - collaborative PDF manager. Not maintained, new version is at https://github.com/mkucej/i-librarian-free☆99Updated 3 years ago
- Index and search PDF files using Apache Lucene and PDF Box☆43Updated 3 months ago
- Generates a customizable print-on-demand paperback book cover as a PDF using LaTeX☆65Updated 2 years ago
- LocalCopy is a plugin that extends the popular reference manager JabRef. It provides an automatic download feature for preprints from the…☆27Updated 14 years ago
- clone of docfetcher from sourceforge☆63Updated 12 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆303Updated 8 months ago