WebHunger is an extensible, full-scale crawler framework that supports distributed crawling, aiming at getting users focused on web page parsing without concerning for the crawling process.
☆18Apr 11, 2018Updated 8 years ago
Alternatives and similar repositories for webhunger
Users that are interested in webhunger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example server for https://github.com/NaikSoftware/StompProtocolAndroid/tree/master/example-client☆11Feb 28, 2017Updated 9 years ago
- spider of doubanbook☆10Jun 21, 2017Updated 8 years ago
- 使用倒排索引及二分法实现了一个简单的规则匹配☆18Mar 14, 2019Updated 7 years ago
- Sync是一款分布式场景下基于Redis的安全高效的线程同步组件,提供分布式可重入互斥锁、分布式可重入读写锁、分布式信号量。提供相应注解,使用简单,可与spring-boot无缝集成。☆13Oct 8, 2022Updated 3 years ago
- 分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入☆13Dec 18, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Sep 3, 2018Updated 7 years ago
- 简单状态机实现。同时以简化的订单状态机为例子进行了说明。☆15Oct 13, 2020Updated 5 years ago
- 每天三分钟的科技新闻聚合阅读☆18May 15, 2018Updated 7 years ago
- springBoot-swagger-mybatis-shardbatis☆22Jan 13, 2017Updated 9 years ago
- 爬虫抓取框架,封装HttpClient,Htmlunit,Selenium等工具☆27Nov 15, 2018Updated 7 years ago
- JavaAgent内存马实现、检测、修复demo☆11Dec 7, 2022Updated 3 years ago
- black Ip lists, dorks-collection☆17Apr 1, 2026Updated 2 weeks ago
- 数据平台(DataPlateform),最初的设计想法是:当今大数据横行,我们也不能落后。所以就想着写一个这样的平台系统。此项目集爬虫、搜索、Hadoop、Dwr推送、Quartz定时任务于一体的平台,其目的是想通过抓取互联网数据,通过大数据推测人或者某一事物的下一行为。C…