denghuichao / proxy-poolLinks

爬虫代理IP池服务，可供其他爬虫程序通过restapi获取

☆113

Alternatives and similar repositories for proxy-pool

Users that are interested in proxy-pool are comparing it to the libraries listed below

Sorting:

hemin1003 / java-spider
一个基于webmagic框架二次开发的java爬虫框架实战，已实现能爬取腾讯，搜狐，今日头条（单独集成功能）等资讯内容，配合elasticsearch框架用法，实现了自动爬虫，已投入线上生产使用。
☆339Updated 2 years ago
dhengyi / ip-proxy-pools-regularly
实现定时爬取与IP代理池
☆148Updated 7 years ago
chenerzhu / proxy-pool
java代理IP池 Proxy Pool，提供可用率达到95%以上的代理IP。
☆395Updated 6 years ago
zyongjava / spider
利用spring boot + webmagic 开发的java爬虫系统
☆61Updated 8 years ago
xpleaf / ispider
A Distributed Crawler System Designed By Java.
☆211Updated 7 years ago
fengzhizi715 / ProxyPool
给爬虫使用的代理IP池
☆562Updated 5 years ago
JFanZhao / spider
使用java+httpclient+httpcleaner，多线程、分布式爬去电商网站商品信息，数据存储在hbase上，并使用solr对商品建立索引，使用redis队列存储一个共享的url仓库；使用zookeeper对爬虫节点生命周期进行监视等。
☆231Updated 4 years ago
qiyaTech / javaCrawling
"奇伢爬虫"是基于sprint boot 、 WebMagic 实现微信公众号文章、新闻、csdn、info等网站文章爬取，可以动态设置文章爬取规则、清洗规则，基本实现了爬取大部分网站的文章。
☆324Updated 7 years ago
out0fmemory / GuozhongCrawler
GuozhongCrawler的是一个无须配置、便于二次开发的爬虫开源框架，它提供简单灵活的API，只需少量代码即可实现一个爬虫。其设计灵感来源于多个爬虫国内外爬虫框架的总结。采用完全模块化的设计，功能覆盖整个爬虫的生命周期(链接提取、页面下载、内容抽取、持久化)，支持多线…
☆96Updated 10 years ago
dhengyi / multithreading-crawlers
多线程爬虫--抓取淘宝商品详情页URL
☆128Updated 6 years ago
hellokaton / elves
🎊 Design and implement of lightweight crawler framework.
☆315Updated 7 years ago
webmagic-io / jobhunter
使用WebMagic抓取招聘信息，并且持久化到Mysql的例子。
☆224Updated 8 years ago
superleeyom / code-artisan
搭建一个通用的 Restful API 接口平台，方便用户快速搭建项目，专注于业务，进行 API 接口开发
☆64Updated 7 years ago
letcheng / ProxyPool
针对反爬虫问题的自动代理池组件
☆78Updated 8 years ago
javagaorui5944 / ProxyIpPool
The Crawler Proxy IP Pool Component
☆64Updated 2 years ago
cbwleft / movie-elasticsearch
使用 SpringBoot2.0+ElasticSearch 实现的开源电影搜索引擎
☆87Updated 2 years ago
xtuhcy / gecco-spring
gecco爬虫和spring结合使用
☆52Updated 7 years ago
QiuMing / zhihuWebSpider
知乎爬虫，基于webmagic框架 .A java web spider base on webmagic.
☆69Updated 9 years ago
xuxueli / xxl-crawler
A lightweight web crawler framework.（Java爬虫框架）
☆730Updated 7 months ago
codesofun / web-bee
🐝 Web vertical crawler framework for fun
☆189Updated last year
DMinerJackie / JewelCrawler
豆瓣电影爬虫——a crawler which is able to crawl movie detail and short comments, save them to database mysql, also include Sentiment analysis ba…
☆69Updated 6 years ago
zifangsky / WeatherSpider
天气爬虫（全国城镇天气自动定时抓取更新，并开放RESTful查询接口），附带代理IP池定时更新并检测其可用性
☆366Updated 7 years ago
EzioL / neteasemusic
webmagic 爬取我喜欢的网易云歌单+评论
☆51Updated 7 years ago
hexiangtao / wechat4j
用java实现的微信客户端，支持自动聊天, 消息监听，自动回复，添加好友，获取群成员列表,自动记录聊天记录，自动下载图片，语音，视频消息
☆302Updated 10 months ago
shenbaise / goodcrawler
网络爬虫
☆52Updated 11 years ago
kanxg / fengchao
蜂巢爬虫系统是一套只需要定义XPath，就可实现爬取网站,APP的系统, 支持多种解析方式（XPath,正则表达式），多种下载方式（HttpClient库, PhantomJs, Selenium）,多种输出方式（Excel，MongoDB）。可不做任何修改发布到Yar…
☆5Updated 8 years ago
hxyfj / LagouSpider
拉勾网数据爬虫
☆32Updated 7 years ago
hemin1003 / aylson-parent
一个SpringMVC4+EasyUI的后台管理系统，已投入生产线上使用。下载导入SQL脚本，开箱即用，五分钟完成部署。
☆148Updated 2 years ago
wucao / JCatch
Exception异常管理平台，支持Java、PHP、Python等多种语言
☆85Updated 2 years ago
tianshb / MagicToe
基于webmagic + springboot + mybatis的Java爬虫，使用Echarts进行数据可视化分析，提供了从爬虫获取数据到数据持久化、数据可视化分析以及构建简单的代理池等一整套解决方案模板。
☆367Updated 7 years ago