heartfly / ajax_crawlerLinks

A flexible web crawler based on Scrapy for fetching most of Ajax or other various types of web pages. Easy to use: To customize a new web crawler-You just need to write a config file and run.

☆45

Alternatives and similar repositories for ajax_crawler

Users that are interested in ajax_crawler are comparing it to the libraries listed below

Sorting:

wuchong / scrapy-dynamic-configurable
A dynamic configurable news crawler based Scrapy
☆165Updated 8 years ago
shelmesky / crawler
淘宝爬虫原型，基于gevent
☆49Updated 12 years ago
immzz / zhihu-scrapy
A scrapy zhihu crawler
☆77Updated 7 years ago
backto17 / SinaHouseCrawler
基于scrapy,scrapy-redis实现的一个分布式网络爬虫,爬取了新浪房产的楼盘信息及户型图片,实现了常用的爬虫功能需求.
☆40Updated 8 years ago
binhe22 / pullword
A Python package for pullword.com
☆86Updated 5 years ago
hailong0707-zz / spider_news_all
Scrapy Spider for 各种新闻网站
☆110Updated 10 years ago
younghz / scrapy-redis
Redis-based components for scrapy that allows distributed crawling
☆46Updated 11 years ago
mazzzystar / BaiduCrawler
Sample of using proxies to crawl baidu search results.
☆118Updated 7 years ago
ClericPy / EC-Spider
Obsolete 已废弃.
☆86Updated 8 years ago
weizetao / spider-roach
分布式定向抓取集群
☆71Updated 8 years ago
yoyzhou / weibo_scrapy
WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.
☆155Updated 8 years ago
LiuRoy / github_spider
使用代理调用github API爬去用户数据
☆185Updated 9 years ago
2shou / PhantomjsFetcher
A python web fetcher using phantomjs to mock browser
☆180Updated 8 years ago
tpeng / weibosearch
A distributed Sina Weibo Search spider base on Scrapy and Redis.
☆145Updated 12 years ago
Wooden-Robot / spider-practice
☆20Updated 9 years ago
wanghuafeng / e-business
电商爬虫系统：京东，当当，一号店，国美爬虫（代理使用）；论坛、新闻、豆瓣爬虫
☆104Updated 7 years ago
KDF5000 / RSpider
一个基于scrapy-redis的分布式爬虫模板
☆43Updated 8 years ago
Andrew-liu / scrapy_example
This repository store some example to learn scrapy better
☆177Updated 5 years ago
salamer / Zhihu_Crawler
a crawler for zhihu
☆94Updated 8 years ago
chensoul / scrapy-zhihu-github
scrapy examples for crawling zhihu and github
☆223Updated 2 years ago
LiuXingMing / Tmall1212
天猫双12爬虫，附商品数据。
☆201Updated 8 years ago
LiuRoy / spider_docker
为爬虫引用创建container，包括的模块：scrapy, mongo, celery, rabbitmq
☆37Updated 9 years ago
fxsjy / jiebademo
a demo site for jieba
☆111Updated 12 years ago
yanshengli / sina_weibo_crawler
利用urllib2加beautifulsoup爬取新浪微博
☆70Updated 10 years ago
widy28 / scrapy-taobao
scrapy模拟淘宝登陆
☆74Updated 5 years ago
yoghurtjia / Zhihu_bigdata
使用scrapy和pandas完成对知乎300w用户的数据分析。首先使用scrapy爬取知乎网的300w，用户资料，最后使用pandas对数据进行过滤，找出想要的知乎大牛，并用图表的形式可视化。
☆160Updated 8 years ago
Vespa314 / douban_scrapy
将会陆续添加豆瓣里面各种信息的爬虫代码和分析
☆25Updated 11 years ago
pelick / VerticleSearchEngine
Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS
☆100Updated 12 years ago
PyCN / Ugly-Distributed-Crawler
基于Redis实现的简单到爆的分布式爬虫
☆14Updated 8 years ago
multiangle / Distributed_Microblog_Spider
分布式新浪微博爬虫
☆31Updated 8 years ago