kiorky / spynner
Programmatic web browsing module with AJAX support for Python
☆860Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for spynner
- ☆143Updated 8 years ago
- Useful test spiders for Scrapy☆183Updated 4 years ago
- Scrapy extension to control spiders using JSON-RPC☆296Updated 5 years ago
- MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…☆357Updated 3 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆31Updated 6 years ago
- Python web scraping framework☆316Updated 7 years ago
- "Scrape Easy" - an extension of the Scrapy framework.☆188Updated 8 years ago
- Webkit based scriptable web browser for python.☆2,764Updated 8 months ago
- [not actively maintained] A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages☆533Updated 7 years ago
- Run JavaScript code from Python (EOL: https://gist.github.com/doloopwhile/8c6ec7dd4703e8a44e559411cb2ea221)☆708Updated 4 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆613Updated 7 years ago
- Scrapy Middleware to set a random User-Agent for every Request.☆201Updated 5 years ago
- Fast Python Bloom Filter using Mmap☆741Updated 5 years ago
- Python/JavaScript bridge module, making use of Mozilla's spidermonkey JavaScript implementation.☆305Updated 7 years ago
- Mailing for human beings☆590Updated 5 years ago
- PyTime is an easy-use Python module which aims to operate date/time/datetime by string.☆159Updated 2 years ago
- DEPRECATED: Pure Python API for Maxmind's binary GeoIP databases☆482Updated 6 years ago
- Asyncronous HTTP proxy with tunnelling (CONNECT) support☆337Updated last year
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆117Updated 11 years ago
- A pure-python HTML screen-scraping library☆1,863Updated 2 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆267Updated 3 years ago
- Compiled PyV8 for Mac OS X☆103Updated 12 years ago
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆618Updated 7 years ago
- Python library of web-related functions☆392Updated last month
- Fast Redis Bloom Filters in Python☆289Updated 5 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆161Updated 2 years ago
- Simple wrapper to drive Google Chrome from Python☆317Updated 3 years ago
- Non-blocking Celery client for Tornado☆567Updated 7 years ago
- DEPRECATED: Replaced by https://github.com/autopilot-rs/autopy☆841Updated 6 years ago
- CSS Selectors for Python☆291Updated last month