witwall / pdf2htmlEXLinks
Convert PDF to HTML without losing text or format.
☆21Updated 10 years ago
Alternatives and similar repositories for pdf2htmlEX
Users that are interested in pdf2htmlEX are comparing it to the libraries listed below
Sorting:
- Cytoscape 3 desktop version.☆17Updated 3 weeks ago
- HTML5 Customizable Reader & Admin Console - Librelio Digital Publishing Suite☆29Updated 10 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆47Updated 8 years ago
- ☆23Updated 2 years ago
- SQL beautifier for databases include but not limited to Oracle, SQL Server, DB2, Sybase, MySQL, PostgreSQL, Teradata.☆52Updated last year
- A fork of the Arc90 Labs Readability bookmarklet☆82Updated 6 years ago
- Auto complete plugin from dictionary with no external dependencies☆466Updated 8 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- An extendable docx file format parser and converter☆194Updated 7 months ago
- HtmlClipper is a bookmarklet which lets you copy html sections of any web pages together with the attached css styles.☆67Updated 4 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A dynamic media input form developed for oTranscribe☆18Updated 10 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 7 months ago
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆56Updated last year
- Python client for Docverter service (pandoc as a service)☆17Updated 7 years ago
- yael (Yet Another EPUB Library) is a Python library for reading, manipulating, and writing EPUB 2/3 files☆18Updated 10 years ago
- Easily explore, view and edit markdown documentation of a file tree☆67Updated last year
- Chrome extension for XPaths operations done the right way.☆44Updated 6 years ago
- ☆50Updated 3 years ago
- A simple Python HTTP downloader that support multi-thread downloading and multi-segment file downloading.☆34Updated 8 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆10Updated 7 months ago
- Logya is a static site generator written in Python designed to be easy to use and flexible.☆18Updated last month
- Notepad++ plugin to run Python scripts☆40Updated 6 years ago
- Artificial Intelligence Knowledge Information Framework☆55Updated 2 years ago
- Docverter Server☆834Updated 9 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Team collaboration and document management WIKI system☆52Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆118Updated this week