deajan / pmOCRLinks
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆67Updated 2 years ago
Alternatives and similar repositories for pmOCR
Users that are interested in pmOCR are comparing it to the libraries listed below
Sorting:
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆301Updated 7 months ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆61Updated 3 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆156Updated 2 years ago
- web interface for recoll desktop search☆292Updated 5 years ago
- Building scantailor and its dependencies☆65Updated 2 years ago
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 4 years ago
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆42Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆406Updated last year
- Short script for removing watermarks from PDF files. Requires pdftk.☆59Updated 6 years ago
- BulkPDF is a free and easy to use open source software, which allows to automatically fill an existing PDF form with differen values. Onl…☆134Updated last year
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆36Updated 10 years ago
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆16Updated 9 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆351Updated 10 months ago
- OCRmyPDF EasyOCR plugin☆97Updated 4 months ago
- A tiny frontend for OCRing PDF files via the web.☆51Updated 5 years ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆89Updated 4 months ago
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 3 years ago
- Export / upload emails from Thunderbird mbox files to single eml files☆23Updated 2 years ago
- A curated list of awesome projects to simplify and improve paper and document scanning.☆496Updated 6 months ago
- A frontend for various backup programs (rsync, rdiff-backup, rclone) that simplifies local and remote backups.☆24Updated 9 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆198Updated 8 months ago
- Powerful C++ web crawler based on libcurl☆30Updated last year
- TagSpaces Web Clipper for Chrome and Firefox☆50Updated 9 months ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 12 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆132Updated 2 weeks ago
- Make your PDF files text-searchable (A GUI for OCRmyPDF)☆50Updated last year
- Family Tree Analyzer - Finds hidden details in your family tree. Install at☆58Updated this week
- WIP tag-based file organizer & search☆39Updated last month