deajan / pmOCRLinks
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆65Updated last year
Alternatives and similar repositories for pmOCR
Users that are interested in pmOCR are comparing it to the libraries listed below
Sorting:
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆295Updated 2 months ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆58Updated 3 years ago
- web interface for recoll desktop search☆290Updated 5 years ago
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- Building scantailor and its dependencies☆60Updated 2 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆348Updated 5 months ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆59Updated 6 years ago
- A post-processing tool for scanned sheets of paper.☆82Updated last year
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆153Updated last year
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆42Updated last year
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- A chrome extension for automatically save the visited pages and the downloaded URLs in your bookmarks.☆16Updated 9 years ago
- Export / upload emails from Thunderbird mbox files to single eml files☆23Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 4 months ago
- QtSemanticNotes is a personal knowledge base, personal wiki or just note taking application that features automatic linking, tree view an…☆18Updated 7 years ago
- Server backend and CLI toolkit for WebScrapBook browser extension.☆90Updated this week
- Virtuoz virtual desktop utility☆38Updated 9 years ago
- Juris-M is a variant of the free and friendly Zotero research platform, with support for legal and multilingual materials.☆84Updated 9 months ago
- The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・…☆47Updated last month
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 11 years ago
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆15Updated 8 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 10 months ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆88Updated 5 months ago
- The source for the TagSpaces documentation website☆29Updated 3 weeks ago
- Very simple file search web interface with a locate / mlocate backend☆25Updated 7 months ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆35Updated 9 years ago
- [ARCHIVED]☆24Updated 6 years ago
- Linux-intelligent-ocr-solution☆144Updated 2 months ago
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- A tiny, hackable, two-way cloud synchronisation client for Linux☆55Updated 5 years ago