deajan / pmOCRLinks
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
☆65Updated last year
Alternatives and similar repositories for pmOCR
Users that are interested in pmOCR are comparing it to the libraries listed below
Sorting:
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- Frontend part i.e. web-based user interface of Papermerge Document Management System☆37Updated 2 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆347Updated 4 months ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆58Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆82Updated last year
- QtSemanticNotes is a personal knowledge base, personal wiki or just note taking application that features automatic linking, tree view an…☆18Updated 7 years ago
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆42Updated last year
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆34Updated 9 years ago
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- Tool to index and serve HTML files. Powered by Datasette.☆103Updated 3 years ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 11 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆117Updated 3 weeks ago
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆15Updated 8 years ago
- Automatic de-keystoning for single camera DIY book scanners.☆49Updated 4 years ago
- Server backend and CLI toolkit for WebScrapBook browser extension.☆90Updated last week
- PDF to XML ALTO file converter☆247Updated 2 weeks ago
- compare two PDF files, write a resulting PDF with highlighted changes☆56Updated 11 months ago
- BulkPDF is a free and easy to use open source software, which allows to automatically fill an existing PDF form with differen values. Onl…☆133Updated 9 months ago
- Various Python tools for Joplin (Hotfolder, PDF Previews, ToDo List) and AutoIt (JoplinWinBackup) for Backups under Windows.☆46Updated 4 years ago
- Juris-M is a variant of the free and friendly Zotero research platform, with support for legal and multilingual materials.☆83Updated 9 months ago
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 2 years ago
- A chrome extension for automatically save the visited pages and the downloaded URLs in your bookmarks.☆16Updated 9 years ago
- Efficient hOCR tooling☆47Updated 3 weeks ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆191Updated 2 months ago
- List of tools for dealing with the wonderful PDF format.☆51Updated 4 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- Generates bookmarks from the table of contents already available at the beginning of pdf files.☆37Updated 3 weeks ago
- Open Source PDF Document Management☆136Updated this week
- Extensible outliner and personal time organizer to manage todo lists, schedule tasks, remind events.☆47Updated 7 years ago