miohtama / pdf-to-html
PDF to JPEG images + HTML with <img> alt text converter
☆49Updated 10 years ago
Alternatives and similar repositories for pdf-to-html:
Users that are interested in pdf-to-html are comparing it to the libraries listed below
- A simple flask app that runs on heroku and demonstrates HTTP Server-Sent Events (EventSource) protocol.☆105Updated last year
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 11 years ago
- Extract meaningful content from pdf and psd file, such as texts and images both linked into a common JSON string☆37Updated 7 years ago
- ☆79Updated 9 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆130Updated last year
- A example of verbal communication using ChatterBot☆107Updated 5 years ago
- a quick and dirty script to convert a Word (docx) document to html.☆53Updated 3 years ago
- ☆128Updated 8 years ago
- Cocktail party problem solution using deep learning☆16Updated 7 years ago
- Hacker News REST API using Flask on Heroku using memcached.☆91Updated 10 years ago
- An Instagram Scrapper☆85Updated 7 years ago
- Python module that intent to crack basic captcha engines using OpenCV and Pytesser☆39Updated 10 years ago
- Solve a maze simply by pointing a camera at it☆34Updated 8 years ago
- A search web app built by Flask and Google CSE☆183Updated 2 years ago
- A ChatterBot logic adapter that returns information about the current weather☆29Updated 5 years ago
- ☆39Updated 4 years ago
- playing pacman with gestures☆31Updated 7 years ago
- Downloading any number of images for a search query☆78Updated 6 years ago
- Tool to extract news articles from newspaper and give the context about the news☆210Updated 7 years ago
- Nudity detection in Python☆26Updated 11 years ago
- Personal development repository to prepare contributions and patches for Apache Mahout☆16Updated 14 years ago
- API - a simple web search engine☆24Updated 7 years ago
- Use the Google Custom Search API from Python.☆35Updated 10 years ago
- PhantomJS compiled on a Raspberry Pi 3, working binary ready to download and run.☆15Updated 8 years ago
- Python: An all-in-one Web Crawler, Web Parser and Web Scrapping library!☆114Updated 11 months ago
- A python wrap for Baidu Yuyin API☆10Updated 8 years ago
- This is the code for the "Build a Web Scraper" Live stream by @Sirajology on Youtube☆54Updated 6 years ago
- 用于抓取百度百科中的百科名片及列表部分信息☆9Updated 11 years ago
- A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)☆42Updated 10 years ago
- Implementing Search Engine and Page Rank Algorithm Using Python☆18Updated 8 years ago