deajan / pmOCR
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆66Updated last year
Alternatives and similar repositories for pmOCR:
Users that are interested in pmOCR are comparing it to the libraries listed below
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆41Updated 9 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆183Updated 3 months ago
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆12Updated 8 years ago
- Ergonomic line-by-line transcription of scanned text.☆50Updated 4 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆36Updated this week
- A post-processing tool for scanned sheets of paper.☆77Updated 10 months ago
- Prepress preparing tool and PDF editor☆17Updated last year
- The hOCR Embedded OCR Workflow and Output Format☆73Updated 5 months ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆58Updated 5 years ago
- TagSpaces Web Clipper for Chrome and Firefox☆40Updated 2 weeks ago
- Building scantailor and its dependencies☆57Updated last year
- Integrate programs or scripts into common tools like Windows Explorer context menu☆23Updated 6 months ago
- The CIS OCR PostCorrectionTool☆40Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆279Updated 11 months ago
- Efficient hOCR tooling☆42Updated 4 months ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆33Updated 9 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...☆20Updated this week
- Simple tools for summarizing .mbox email archives.☆11Updated 4 years ago
- BulkPDF is a free and easy to use open source software, which allows to automatically fill an existing PDF form with differen values. Onl…☆129Updated 3 months ago
- List of tools for dealing with the wonderful PDF format.☆46Updated 4 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- Make your PDF files text-searchable (A GUI for OCRmyPDF)☆35Updated 6 months ago
- Export / upload emails from Thunderbird mbox files to single eml files☆21Updated last year
- Fess Site Search provides JavaScript files.☆23Updated 3 weeks ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆57Updated 2 years ago
- OCR for DjVu☆47Updated 2 years ago
- Graphical User Interface for factur-x library with basic functionalities☆24Updated 5 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago