miohtama / pdf-to-html
PDF to JPEG images + HTML with <img> alt text converter
☆49Updated 10 years ago
Alternatives and similar repositories for pdf-to-html:
Users that are interested in pdf-to-html are comparing it to the libraries listed below
- cropy : Python content based image crop API and shell command☆18Updated last year
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 11 years ago
- A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)☆42Updated 11 years ago
- A script that tries to extract a sudoku from an image and solve it.☆77Updated 10 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Household Intelligent Assistant☆20Updated 10 months ago
- Given an MP3 file we attempt to predict it's genre.☆15Updated 9 years ago
- Spam filtering made easy for you☆142Updated 5 years ago
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.☆23Updated 13 years ago
- Create python web applications for Google Glass☆280Updated 11 years ago
- "20 Newsgroups" text classification with python☆151Updated 8 years ago
- Python script to dump profile pictures of friends/random people☆16Updated 8 years ago
- A python script which replies to all birthday wishes on your facebook wall.☆7Updated 6 years ago
- ☆50Updated 3 years ago
- Recommender Systems in Depth: An introduction to Recommender Systems using Python and Crab☆44Updated 11 years ago
- Example how to use dynamic items when working with Scrapy☆9Updated 9 years ago
- Input a text and get it as your wallpaper☆7Updated 8 years ago
- It uses machine learning models (Multinomial NB & SVM) to predict whether the email is spam or ligitimate on two corpus namely Ling-spam …☆89Updated 8 years ago
- PhantomJS compiled on a Raspberry Pi 3, working binary ready to download and run.☆15Updated 8 years ago
- a quick and dirty script to convert a Word (docx) document to html.☆53Updated 3 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last year
- Scrapy examples crawling Craigslist☆199Updated 9 years ago
- A simple flask app that runs on heroku and demonstrates HTTP Server-Sent Events (EventSource) protocol.☆105Updated last year
- ☆167Updated 6 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 12 years ago
- ☆17Updated 6 years ago
- A simple crawler in python☆25Updated 12 years ago
- sentiment analysis for twitter☆51Updated 13 years ago
- End to end OCR system for Telugu. Based on Convolutional Neural Networks.☆50Updated 3 years ago
- Hacker News REST API using Flask on Heroku using memcached.☆91Updated 10 years ago