witwall / pdf2htmlEX
Convert PDF to HTML without losing text or format.
☆21Updated 9 years ago
Alternatives and similar repositories for pdf2htmlEX:
Users that are interested in pdf2htmlEX are comparing it to the libraries listed below
- Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also incl…☆258Updated 10 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- Python client for Docverter service (pandoc as a service)☆17Updated 7 years ago
- PageArchiver (previously called "Scrapbook for SingleFile") is a Chrome extension that helps to archive pages for offline reading☆87Updated 11 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆33Updated 9 years ago
- Citation Style Language utilities☆18Updated 3 years ago
- Java program to add bookmarks to pdf (stable)☆27Updated 4 years ago
- SQL beautifier for databases include but not limited to Oracle, SQL Server, DB2, Sybase, MySQL, PostgreSQL, Teradata.☆51Updated last year
- Data Store for Annotation Studio☆46Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- HtmlClipper is a bookmarklet which lets you copy html sections of any web pages together with the attached css styles.☆67Updated 3 years ago
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- Tasks around metadata.☆21Updated 3 weeks ago
- An online annotation platform for teaching and learning in the humanities.☆107Updated 2 months ago
- The XML treeview for Notepad++☆44Updated 2 years ago
- CSV Buddy helps you make your CSV files ready to be imported by a variety of software. Load/save/export files with various delimiters and…☆32Updated 2 years ago
- Analyze PowerBuilder (PowerScript) source code☆20Updated 2 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- Auto complete plugin from dictionary with no external dependencies☆468Updated 7 years ago
- Data Explorer is an open-source interactive data-visualization tool.☆36Updated 10 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- A dynamic media input form developed for oTranscribe☆18Updated 9 years ago
- Mouse gesture application for Windows☆44Updated last year
- Extract data from an HTML table and store results to a csv file.☆38Updated 9 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- chrome extension that takes automatic screenshots for a given url☆110Updated 11 years ago
- A tool to batch print PDFs☆12Updated 9 months ago