scalingexcellence / scrapybookLinks

Scrapy Book Code

☆483

Alternatives and similar repositories for scrapybook

Users that are interested in scrapybook are comparing it to the libraries listed below

Sorting:

scrapy-plugins / scrapy-zyte-smartproxy
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
☆366Updated 7 months ago
BruceDone / scrapy_demo
all kinds of scrapy demo
☆164Updated 2 years ago
djm / python-scrapyd-api
A Python wrapper for working with Scrapyd's API.
☆271Updated last year
flisky / scrapy-phantomjs-downloader
PhantomJS Downloader for Scrapy, Yeah!
☆94Updated 11 years ago
aivarsk / scrapy-proxies
Random proxy middleware for Scrapy
☆1,672Updated 6 years ago
scrapy / scrapyd-client
Command line client for Scrapyd server
☆777Updated 2 months ago
alecxe / scrapy-fake-useragent
Random User-Agent middleware based on fake-useragent
☆693Updated 2 years ago
scrapinghub / scrapy-training
Scrapy Training companion code
☆173Updated 6 years ago
scrapy-plugins / scrapy-djangoitem
Scrapy extension to write scraped items using Django models
☆503Updated 2 years ago
yidao620c / core-scrapy
python-scrapy demo
☆810Updated 5 years ago
scalingexcellence / scrapybook-2nd-edition
Scrapy Book 2nd Edition Code http://scrapybook.com/
☆48Updated 3 years ago
LiuXingMing / Scrapy_Redis_Bloomfilter
基于Redis的Bloomfilter去重，并将其扩展到Scrapy框架。
☆347Updated 2 years ago
scrapy-plugins / scrapy-deltafetch
Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls
☆275Updated 8 months ago
scrapy-plugins / scrapy-jsonrpc
Scrapy extension to control spiders using JSON-RPC
☆299Updated 6 years ago
kohn / HttpProxyMiddleware
A middleware for scrapy. Used to change HTTP proxy from time to time.
☆323Updated 7 years ago
scrapinghub / testspiders
Useful test spiders for Scrapy
☆185Updated 5 years ago
scrapy / quotesbot
This is a sample Scrapy project for educational purposes
☆1,343Updated last year
voliveirajr / seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
☆128Updated 6 years ago
cnu / scrapy-random-useragent
Scrapy Middleware to set a random User-Agent for every Request.
☆202Updated 6 years ago
istresearch / scrapy-cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,219Updated last year
geekan / scrapy-examples
Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.
☆3,256Updated last year
Python3WebSpider / ScrapyRedisBloomFilter
Scrapy Redis Bloom Filter
☆176Updated 4 years ago
liyaopinner / BloomFilter_imooc
☆69Updated 7 years ago
sebdah / scrapy-mongodb
MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…
☆358Updated 4 years ago
Germey / CookiesPool
CookiesPool Based on Redis
☆152Updated 7 years ago
rmax / dirbot-mysql
Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.
☆118Updated 12 years ago
Python3WebSpider / ScrapySplashTest
Scrapy Splash on Taobao Product
☆31Updated 8 years ago
liuslnlp / ProxyPool
跨语言IP代理池，Python实现。
☆354Updated 7 years ago
scrapinghub / scrapyrt
HTTP API for Scrapy spiders
☆871Updated last month
scrapy / dirbot
Scrapy project to scrape public web directories (educational) [DEPRECATED]
☆1,630Updated 8 years ago