miohtama / pdf-to-htmlLinks
PDF to JPEG images + HTML with <img> alt text converter
☆49Updated 11 years ago
Alternatives and similar repositories for pdf-to-html
Users that are interested in pdf-to-html are comparing it to the libraries listed below
Sorting:
- Create python web applications for Google Glass☆280Updated 11 years ago
- A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)☆42Updated 11 years ago
- A command-line interactive coursera-downloader.☆15Updated 7 years ago
- A simple flask app that runs on heroku and demonstrates HTTP Server-Sent Events (EventSource) protocol.☆106Updated 2 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆50Updated 6 years ago
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 12 years ago
- [not actively maintained] The C++ webkit-server from capybara-webkit with useful extensions and Python bindings☆48Updated 4 years ago
- A simple Python HTTP downloader that support multi-thread downloading and multi-segment file downloading.☆34Updated 8 years ago
- Python module that intent to crack basic captcha engines using OpenCV and Pytesser☆39Updated 11 years ago
- simple inverted index full text search engine written in python☆12Updated 11 years ago
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.☆23Updated 13 years ago
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆117Updated 11 years ago
- Python project scraping imdb and web application implemented using Flask.☆54Updated 10 years ago
- A Translation Tool for Humans☆121Updated 7 years ago