scantailor / ScanTailor-CLI-GUILinks
Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs
☆16Updated 9 years ago
Alternatives and similar repositories for ScanTailor-CLI-GUI
Users that are interested in ScanTailor-CLI-GUI are comparing it to the libraries listed below
Sorting:
- Building scantailor and its dependencies☆65Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆133Updated 2 weeks ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆61Updated 3 years ago
- Automatic de-keystoning for single camera DIY book scanners☆25Updated 9 years ago
- PDF to DjVu converter☆99Updated 2 years ago
- CDXJ Indexing of WARC/ARCs☆31Updated last year
- Conversions between various OCR formats☆82Updated 2 years ago
- Automatic de-keystoning for single camera DIY book scanners.☆50Updated 5 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆236Updated 2 weeks ago
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆57Updated 4 months ago
- search interface for scholarly works☆85Updated last year
- The CIS OCR PostCorrectionTool☆44Updated 3 years ago
- Linux-intelligent-ocr-solution☆149Updated 7 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆60Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Updated 4 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆104Updated 2 weeks ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆198Updated 8 months ago
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year
- Efficient hOCR tooling☆55Updated 5 months ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated 2 years ago
- Raspberry Pi image for controlling a DIYBookScanner via spreads☆37Updated 10 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆35Updated 10 years ago
- Convert ALTO XML to plain text + minimal metadata☆17Updated last year
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆159Updated 10 months ago
- A collection of tools for archiving and analysing the internet.☆76Updated 3 years ago
- Tools to process books in a cloud based pipeline system☆65Updated last month
- Simple, unobtrusive time tracking utility for Windows☆18Updated 5 years ago
- Specifications developed and maintained by the Webrecorder community.☆140Updated 3 months ago
- WordNet-LMF formats☆24Updated 2 months ago