witwall / pdf2htmlEXLinks
Convert PDF to HTML without losing text or format.
☆21Updated 10 years ago
Alternatives and similar repositories for pdf2htmlEX
Users that are interested in pdf2htmlEX are comparing it to the libraries listed below
Sorting:
- HTML5 Customizable Reader & Admin Console - Librelio Digital Publishing Suite☆29Updated 10 years ago
- PageArchiver (previously called "Scrapbook for SingleFile") is a Chrome extension that helps to archive pages for offline reading☆90Updated 12 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)☆42Updated 11 years ago
- A simple viewer and inspection tool for text boxes in PDF documents☆96Updated 3 years ago
- Auto complete plugin from dictionary with no external dependencies☆467Updated 8 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated 2 weeks ago
- Recipes for calibre☆69Updated 12 years ago
- Chrome extension to select and copy table cells.☆139Updated 3 years ago
- Lacuna: Digital Annotation for Teaching and Learning☆37Updated 7 years ago
- PdfJs-Annotator is a proof of concept project that integrates AnnotatorJs (http://annotatorjs.org/) with the PdfJs (https://mozilla.githu…☆25Updated 5 years ago
- A library for extracting tables from PDF files☆89Updated 12 years ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 13 years ago
- Tesseract documentation☆75Updated 4 years ago
- Various Annotorious plugins that add additional image selection tools☆21Updated 7 years ago
- Open Video Annotation Project☆112Updated 8 years ago
- Suite of tools for detecting changes in web pages and their rendering☆55Updated 2 years ago
- HtmlClipper is a bookmarklet which lets you copy html sections of any web pages together with the attached css styles.☆67Updated 4 years ago
- A library for extracting tables from PDF files☆92Updated 5 years ago
- A dynamic media input form developed for oTranscribe☆18Updated 10 years ago
- Python client for Docverter service (pandoc as a service)☆17Updated 7 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 8 months ago
- ☆37Updated 7 years ago
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆35Updated 10 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆119Updated this week
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆78Updated 6 years ago
- Artificial Intelligence Knowledge Information Framework☆55Updated 2 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆26Updated 11 years ago
- A toolkit for clustering web pages based on various similarity measures.☆34Updated 4 years ago
- Use Pandoc and Calibre to compile Markdown text to Epub, with source included in the Epub.☆48Updated 3 years ago