witwall / pdf2htmlEX
Convert PDF to HTML without losing text or format.
☆21Updated 9 years ago
Alternatives and similar repositories for pdf2htmlEX:
Users that are interested in pdf2htmlEX are comparing it to the libraries listed below
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆131Updated last year
- PageArchiver (previously called "Scrapbook for SingleFile") is a Chrome extension that helps to archive pages for offline reading☆87Updated 11 years ago
- jQuery based XML editor plugin.☆154Updated last week
- jQuery XPath plugin (with full XPath 2.0 language support)☆180Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆50Updated 4 years ago
- Plugin to use rich text in Annotator☆30Updated 10 years ago
- Open Video Annotation Project☆111Updated 7 years ago
- An online annotation platform for teaching and learning in the humanities.☆107Updated 2 months ago
- PdfJs-Annotator is a proof of concept project that integrates AnnotatorJs (http://annotatorjs.org/) with the PdfJs (https://mozilla.githu…☆24Updated 4 years ago
- Semantic data wiki as well as Linked Data publishing engine☆205Updated 7 months ago
- balloon-Consuming Linked Data☆18Updated 8 years ago
- ☆36Updated 9 years ago
- workflowy clone☆11Updated 2 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- Inline annotation for the web in pure Javascript. Select text, images, or (nearly) anything else, and add your notes.☆9Updated 8 years ago
- Wed is a web-based editor that assists users in editing XML documents according to a schema.☆24Updated 6 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- Easily explore, view and edit markdown documentation of a file tree☆65Updated 7 months ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Web application for management formal representations of knowledge, like controlled vocabularies, taxonomies, thesauri and glossaries☆126Updated last month
- Index and search PDF files using Apache Lucene and PDF Box☆43Updated 4 years ago
- A Lightweight RSS/feed fetcher☆31Updated 3 weeks ago
- HtmlClipper is a bookmarklet which lets you copy html sections of any web pages together with the attached css styles.☆67Updated 3 years ago
- Word-to-DITA transformation framework. Enables generation of DITA maps and topics from styled Microsoft Word documents.☆16Updated 2 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 7 months ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated 2 years ago
- A Desktop Bookmarking App☆42Updated 5 years ago
- Convert an HTML table into an ASCII table: Colspan and Rowspan allowed!☆43Updated 3 years ago