vinta / haul
An Extensible Image Crawler
☆159Updated 8 years ago
Alternatives and similar repositories for haul:
Users that are interested in haul are comparing it to the libraries listed below
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- Web Content Retrieval for Humans™☆618Updated 2 years ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- "Scrape Easy" - an extension of the Scrapy framework.☆188Updated 8 years ago
- Image histogram remapping☆213Updated 5 years ago
- PyTime is an easy-use Python module which aims to operate date/time/datetime by string.☆158Updated 2 years ago
- A basket of python snippets☆221Updated 9 years ago
- Friendly Python Dates☆189Updated 5 years ago
- Python web scraping framework☆313Updated 7 years ago
- elegant email sending for Python☆195Updated 4 years ago
- A flask API for running your scrapy spiders☆128Updated 6 years ago
- Tiny python web crawler☆169Updated 8 years ago
- A Python 3 library for parsing human-written times and dates☆345Updated 5 years ago
- Tornado Web Crawler☆66Updated 12 years ago
- a small library for extracting rich content from urls☆645Updated 3 months ago
- A simple, immutable URL class with a clean API for interrogation and manipulation.☆292Updated last year
- Fill HTML login forms automatically☆273Updated 11 months ago
- Python library of web-related functions☆400Updated last month
- Readable, simple and fast asynchronous non-blocking network apps☆118Updated last week
- A project to attempt to automatically login to a website given a single seed☆123Updated 2 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆162Updated 2 years ago
- A BitTorrent client written in Python. Supports multi-file torrents and implements a rarest-first piece download strategy.☆30Updated 12 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆612Updated 7 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆32Updated 7 years ago
- Python powered spreadsheets☆173Updated 6 years ago
- Never see escaped bytes in output.☆158Updated 2 years ago
- Python wrapper for the tesseract OCR engine. The module is based on OpenCV☆178Updated 7 years ago
- A Google Charts API for Python, meant to be used as an alternative to matplotlib.☆205Updated 7 years ago
- Python module to allow for easy creation of a google maps HTML file.☆113Updated 6 years ago
- download, install, and configure Python in one line☆212Updated 8 years ago