爬虫项目源码整理,使用redis进行url缓存,hbase进行详细信息的存储。使用zookeeper进行爬虫线程的状态监控。
☆18Oct 7, 2015Updated 10 years ago
Alternatives and similar repositories for spider
Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- akka学习理解,使用了maven、sbt两种构建方式,同时使用量java和scala两种语言实现。akka入门,清晰理解akka流程☆13Oct 18, 2015Updated 10 years ago
- image server base on nosql☆10Aug 24, 2016Updated 9 years ago
- 数据平台(DataPlateform),最初的设计想法是:当今大数据横行,我们也不能落后。所以就想着写一个这样的平台系统。此项目集爬虫、搜索、Hadoop、Dwr推送、Quartz 定时任务于一体的平台,其目的是想通过抓取互联网数据,通过大数据推测人或者某一事物的下一行为。C…☆18Jul 31, 2017Updated 8 years ago
- 🚜线程安全、网络编程、消息中间件文章源码🚜☆28Apr 19, 2018Updated 8 years ago
- 基于ffmpeg+spring+quartz+dubbo+zookeeper+MyBatis服务化的视频转换分布式服务☆12Jul 21, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 关于通过百度地图API采集POI数据,并存储到HBase的项目。☆25Mar 14, 2016Updated 10 years ago
- 利用WebMagic框架进行58同城数据的抓取☆12Oct 13, 2014Updated 11 years ago
- a simple rpc framework for java☆14Dec 9, 2014Updated 11 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- Google 在 2018 年下旬开源了一款新的 Java 工具 Jib,可以轻松地将 Java 应用程序容器化。通过 Jib,我们不需要编写 Dockerfile 或安装 Docker,通过集成到 Maven 或 Gradle 插件,就可以立即将 Java 应用程序容器化…☆21Apr 7, 2019Updated 7 years ago
- j360系列 - 缓存异步写数据库的框架☆15Apr 14, 2016Updated 10 years ago
- rpc_learn Spring + Netty + Protostuff + ZooKeeper 实现了一个轻量级 RPC 框架,使用 Spring 提供依赖注入与参数配置,使用 Netty 实现 NIO 方式的数据传输,使用 Protostuff 实现对象序列化,使用 …☆19May 26, 2015Updated 11 years ago
- 大数据实时计算的基础框架☆48Jan 12, 2015Updated 11 years ago
- word,excel转pdf☆11Oct 26, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sync是一款分布式场景下基于Redis的安全高效的线程同步组件,提供分布式可重入互斥锁、分布式可重入读写锁、分布式信号量。提供相应注解,使用简单,可与spring-boot无缝集成。☆13Oct 8, 2022Updated 3 years ago
- 参加阿里巴巴中间件比赛时的mom项目源码☆22Nov 12, 2015Updated 10 years ago
- 美食搜索 小程序(mpvue框架)☆29Apr 17, 2019Updated 7 years ago
- 基于阿里Dubbo框架的服务切换工具☆19Jul 21, 2017Updated 8 years ago
- No More than a C build system for clang, gcc and msvc☆13Mar 20, 2025Updated last year
- 一个基于redis消息队列的livePush推流器的分布式服务,修改部分livePush推流实现,采用监听redis消息队列方式控制推流和停止☆11Sep 22, 2016Updated 9 years ago
- mysql数据实时同步到redis,基于mysql binlog实现的同步方案☆10Dec 12, 2015Updated 10 years ago
- 微信短视频后端☆14Oct 11, 2023Updated 2 years ago
- 分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入☆13Dec 18, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Netty 源码分析,包含各种流程图☆11Jun 23, 2020Updated 5 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩展的☆16Nov 8, 2015Updated 10 years ago
- 开源项目,供学习☆10May 7, 2021Updated 5 years ago
- 66道算法题目+php解题+java解题☆11Sep 12, 2018Updated 7 years ago
- Just a DEMO to demonstrate how to use JNA to type chars into alipay's password edit control automatically.☆12Dec 21, 2017Updated 8 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 10 years ago
- 爬取百度指数和阿里指数,采用selenium,存入hbase,验证码自动识别,多线程控制☆32Dec 11, 2016Updated 9 years ago
- zookeeper官方提供的分布式锁,选举master,和分布式队列实现☆16Mar 11, 2014Updated 12 years ago
- 小锋生活小助手——JAVA开发的基于爬虫和API实现的查询类微信公众号☆31Jun 7, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- js在线流程图 javascript online flow chart☆24Mar 11, 2019Updated 7 years ago
- 简单状态机实现。同时以简化的订单状态机为例子进行了说明。☆16Oct 13, 2020Updated 5 years ago
- 基于 Spring 和 Jedis 的 Disque 封装,使用注解驱动☆22Feb 15, 2016Updated 10 years ago
- Integration of vertx-web & spring framework☆17Oct 18, 2017Updated 8 years ago
- ☆29Aug 2, 2018Updated 7 years ago
- 数据爬取☆11Sep 1, 2022Updated 3 years ago
- Distributed in-memory cube base on java8 stream☆17Dec 6, 2014Updated 11 years ago