deajan / pmOCR
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆65Updated last year
Alternatives and similar repositories for pmOCR:
Users that are interested in pmOCR are comparing it to the libraries listed below
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated this week
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆187Updated last month
- A tiny frontend for OCRing PDF files via the web.☆47Updated 5 years ago
- A post-processing tool for scanned sheets of paper.☆80Updated last year
- Convert a PDF via OCR to a TXT file in UTF-8 encoding