jnweiger / pdfcompare
compare two PDF files, write a resulting PDF with highlighted changes
☆56Updated 9 months ago
Alternatives and similar repositories for pdfcompare
Users that are interested in pdfcompare are comparing it to the libraries listed below
Sorting:
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated last month
- ConfrontaPDF compares PDF files, GUI or command line☆14Updated 3 years ago
- BibSync is a tool to synchronize scientific papers and bibtex bibliography files☆60Updated 11 years ago
- OCR for DjVu☆48Updated 2 years ago
- Convert text from PDF to XML.☆45Updated 6 years ago
- A PDF comparison utility in Python.☆474Updated 5 months ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated last year
- Automatic de-keystoning for single camera DIY book scanners☆22Updated 9 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 9 months ago
- Repository of PyX, a Python package for the creation of PostScript, PDF, and SVG files.☆120Updated last year
- ☆38Updated 9 years ago
- A toolset for handwriting recognition☆70Updated 2 years ago
- Automatically exported from code.google.com/p/tikzedt☆31Updated 8 years ago
- A step-by-step C# implementation of the Docstrum algorithm☆23Updated 4 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆65Updated last year
- PDF to XML ALTO file converter☆238Updated this week
- PDF Command Line Tools Source☆247Updated last week
- A library for extracting tables from PDF files☆89Updated 4 years ago
- A scientific document recognition system☆170Updated 2 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated last year
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated last week
- Issues related to the tagging project☆54Updated last month
- IPython extension for rendering and displaying asymptote in an IPython notebook.☆41Updated 5 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- The LaTeX source files for the text Fundamentals of Matrix Algebra☆29Updated 11 years ago
- ☆69Updated 7 years ago
- Updating the oberdiek bundle☆17Updated 9 months ago