爬虫项目源码整理,使用redis进行url缓存,hbase进行详细信息的存储。使用zookeeper进行爬虫线程的状态监控。
☆19Oct 7, 2015Updated 10 years ago
Alternatives and similar repositories for spider
Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- akka学习理解,使用了maven、sbt两种构建方式,同时使用量java和scala两种语言实现。akka入门,清晰理解akka流程☆13Oct 18, 2015Updated 10 years ago
- 基于openoffice+jodconverter-core-3.0-beta-4-dist+spring+quartz+dubbo+zookeeper+MyBatis服务化的文档转换服务☆13Jul 21, 2016Updated 9 years ago
- 数据平台(DataPlateform),最初的设计想法是:当今大数据横行,我们也不能落后。所以就想着写一个这样的平台系统。此项目集爬虫、搜索、Hadoop、Dwr推送、Quartz定时任务于一体的平台,其目的是想通过抓取互联网数据,通过大数据推测人或者某一事物的下一行为。C…☆18Jul 31, 2017Updated 8 years ago
- 关于通过百度地图API采集POI数据,并存储到HBase的项目。☆25Mar 14, 2016Updated 10 years ago
- 利用WebMagic框架进行58同城数据的抓取☆12Oct 13, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a simple rpc framework for java☆14Dec 9, 2014Updated 11 years ago
- 阿里巴巴大数据竞赛☆63Jun 2, 2014Updated 11 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- ExtendedMacro - BurpSuite plugin providing extended macro functionality☆15Jan 13, 2021Updated 5 years ago
- 根据关键字扫描github代码泄露☆11Oct 31, 2018Updated 7 years ago
- 基于Mole的一个企业级web应用的架子☆24Jan 30, 2015Updated 11 years ago
- Apache Flink☆12Updated this week
- Google 在 2018 年下旬开源了一款新的 Java 工具 Jib,可以轻松地将 Java 应用程序容器化。通过 Jib,我们不需要编写 Dockerfile 或安装 Docker,通过集成到 Maven 或 Gradle 插件,就可以立即将 Java 应用程序容器化…☆21Apr 7, 2019Updated 7 years ago
- j360系列 - 缓存异步写数据库的框架☆15Apr 14, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Strutsy - Mass exploitation of Apache Struts (CVE-2017-5638) vulnerability☆10Sep 1, 2018Updated 7 years ago
- 大数据实时计算的基础框架☆49Jan 12, 2015Updated 11 years ago
- rpc_learn Spring + Netty + Protostuff + ZooKeeper 实现了一个轻量级 RPC 框架,使用 Spring 提供依赖注入与参数配置,使用 Netty 实现 NIO 方式的数据传输,使用 Protostuff 实现对象序列化,使用 …☆19May 26, 2015Updated 10 years ago
- word,excel转pdf☆11Oct 26, 2018Updated 7 years ago
- 基于CAS单点登录服务端进行二次开发的SpringBoot版轻量级CAS-Server☆10Aug 4, 2022Updated 3 years ago
- 实时数据分析平台☆41Jun 26, 2013Updated 12 years ago
- 参加阿里巴巴中间件比赛时的mom项目源码☆22Nov 12, 2015Updated 10 years ago
- JTS Topology Suite 1.14 with additional functions for GeoSpark☆14Jan 5, 2018Updated 8 years ago
- 基于阿里Dubbo框架的服务切换工具☆19Jul 21, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- No More than a C build system for clang, gcc and msvc☆13Mar 20, 2025Updated last year
- WAF测试工 具---用例测试(增加result输出)☆13Mar 6, 2015Updated 11 years ago
- douyin api,抖音上传接口,抖音接口,抖音搬家,视频备份☆19Aug 9, 2020Updated 5 years ago
- 云笔记项目,大数据项目,Hbase+Redis+Hadoop+Zookeeper☆13May 2, 2018Updated 7 years ago
- 百度爬虫:热词,词频,音乐,poi信息☆21Mar 10, 2015Updated 11 years ago
- 一个基于redis消息队列的livePush推流器的分布式服务,修改部分livePush推流实现,采用监听redis消息队列方式控制推流和停止☆11Sep 22, 2016Updated 9 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- Provable Things documentation☆21Aug 28, 2019Updated 6 years ago
- 微信短视频后端☆14Oct 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入☆13Dec 18, 2015Updated 10 years ago
- Netty 源码分析,包含各种流程图☆11Jun 23, 2020Updated 5 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩展的☆16Nov 8, 2015Updated 10 years ago
- 66道算法题目+php解题+java解题☆11Sep 12, 2018Updated 7 years ago
- Just a DEMO to demonstrate how to use JNA to type chars into alipay's password edit control automatically.☆12Dec 21, 2017Updated 8 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 10 years ago
- 一个简单的NIDS系统☆15May 17, 2017Updated 8 years ago