ddviplinux / crawler-framework

分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入
13Updated 9 years ago

Alternatives and similar repositories for crawler-framework:

Users that are interested in crawler-framework are comparing it to the libraries listed below