jnweiger / pdfcompare
compare two PDF files, write a resulting PDF with highlighted changes
☆56Updated 6 months ago
Alternatives and similar repositories for pdfcompare:
Users that are interested in pdfcompare are comparing it to the libraries listed below
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 11 years ago
- PDF Extraction Toolkit☆41Updated 4 years ago
- Zotero Word for Windows integration☆52Updated this week
- OCR for DjVu☆47Updated 2 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- ☆25Updated last year
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- A visual tool for cropping pdf files☆42Updated 6 years ago
- MathWebSearch Implementation☆47Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 10 months ago
- Extract meaningful content from pdf and psd file, such as texts and images both linked into a common JSON string☆37Updated 7 years ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 6 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆107Updated 6 months ago
- Updating the oberdiek bundle☆17Updated 6 months ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆11Updated 6 months ago
- ☆10Updated 5 years ago
- Desktop Version of Docuburst☆19Updated 8 years ago
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- Building scantailor and its dependencies☆58Updated last year
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆197Updated 5 months ago
- ON HIATUS. Mostly-complete LaTeX engine implemented fully in JavaScript.☆33Updated 9 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Wrapper around pixel classifier☆9Updated 2 years ago
- Extracts highlighted text from PDF documents.☆31Updated 7 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated 11 months ago
- BibSync is a tool to synchronize scientific papers and bibtex bibliography files☆60Updated 10 years ago
- User contributed (non Google) OCR models for Tesseract☆24Updated 3 months ago
- A script to batch rename PDF files based on metadata/XMP title and author☆116Updated 5 years ago
- Punk Nova - an OpenType implementation of Donald Knuth's Punk font☆44Updated 2 years ago