tobya / DocTo
Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF
☆470Updated last month
Alternatives and similar repositories for DocTo:
Users that are interested in DocTo are comparing it to the libraries listed below
- A command line tool to convert Microsoft Office documents to PDFs☆636Updated last year
- Open Source Virtual (Network) Printer for Windows that allows you to create PDFs, OCR text, and print images, with advanced features usua…☆798Updated last year
- ☆691Updated 3 weeks ago
- Microsoft (MS) EMF to SVG conversion library☆98Updated 8 months ago
- Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an ou…☆178Updated last week
- Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also incl…☆258Updated 10 years ago
- Download Poppler binaries packaged for Windows with dependencies☆762Updated 4 months ago
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆58Updated 3 years ago
- Fast integer versions of trained LSTM models☆532Updated 8 months ago
- ☆555Updated 11 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 8 months ago
- RunAsService is a command line tool that allows you to setup a regular console application to run as a service.☆112Updated 3 years ago
- Lister plugin for Total Commander, based on ATSynEdit☆119Updated 3 months ago
- RUPS is an acronym for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText® that allows you to look inside a PDF docume…☆307Updated last week
- WMF to SVG Converting Tool & Library for Java☆85Updated last year
- Demos, examples and utilities using PyMuPDF☆651Updated 9 months ago
- Source training data for Tesseract for lots of languages☆854Updated 3 weeks ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆1,247Updated last year
- A virtual filesystem for various publicly accessible Cloud storage services on the Microsoft Windows platform.☆313Updated 7 years ago
- Python script to do PDF OCR conversion using Tesseract☆374Updated last year
- ☆421Updated 10 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆285Updated last year
- Javascript library for creating annotations in PDF documents☆582Updated 2 years ago
- QT Box Editor of tesseract-ocr box files☆174Updated 6 months ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆210Updated 3 weeks ago
- Indexer++ official repository☆68Updated 5 years ago
- ☆187Updated 4 years ago
- Windows command-line regex file renamer☆67Updated 3 years ago
- A general purpose PDF text-layer redaction tool for Python 2/3.☆196Updated 10 months ago
- A post-processing tool for scanned sheets of paper.☆1,071Updated 9 months ago