deajan / pmOCRLinks
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆65Updated last year
Alternatives and similar repositories for pmOCR
Users that are interested in pmOCR are comparing it to the libraries listed below
Sorting:
- A post-processing tool for scanned sheets of paper.☆82Updated last year
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆33Updated 9 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆59Updated 6 years ago
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆42Updated last year
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 2 years ago
- Exports XMind Mindmap to any documents with Pandoc.☆32Updated 11 years ago
- A tiny frontend for OCRing PDF files via the web.☆50Updated 5 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆189Updated last month
- Extract meaningful content from pdf and psd file, such as texts and images both linked into a common JSON string☆37Updated 7 years ago
- List of tools for dealing with the wonderful PDF format.☆51Updated 4 years ago
- Export / upload emails from Thunderbird mbox files to single eml files☆23Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆290Updated last month
- A repository for LogicalDOC DMS - Community Edition - Docker image https://www.logicaldoc.com/download-logicaldoc-community☆34Updated 3 years ago
- Ergonomic line-by-line transcription of scanned text.☆52Updated 4 years ago
- MOVED TO https://gitlab.com/crossref/pdfmark☆33Updated 6 years ago
- Building scantailor and its dependencies☆58Updated last year
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆153Updated last year
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆24Updated 10 years ago
- A portable devops tool set on windows, easy customization of cmder/console+msys2/cygwin/wsl☆58Updated 3 years ago
- Easily explore, view and edit markdown documentation of a file tree☆66Updated last year
- A tiny, hackable, two-way cloud synchronisation client for Linux☆55Updated 4 years ago
- Java program to add bookmarks to pdf (stable)☆27Updated 4 years ago
- Prepress preparing tool and PDF editor☆18Updated last year
- Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...☆20Updated 3 months ago
- Simple tools for summarizing .mbox email archives.☆11Updated 5 years ago
- The hOCR Embedded OCR Workflow and Output Format☆73Updated 10 months ago
- Recipes for calibre☆69Updated 11 years ago
- Compare documents using MS Word from the command line.☆132Updated 8 months ago
- Reads HTML files, converting tables into CSV files☆31Updated 5 years ago