爬虫项目源码整理,使用redis进行url缓存,hbase进行详细信息的存储。使用zookeeper进行爬虫线程的状态监控。
☆19Oct 7, 2015Updated 10 years ago
Alternatives and similar repositories for spider
Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- akka学习理解,使用了maven、sbt两种构建方式,同时使用量java和scala两种语言实现。akka入门,清晰理解akka流程☆13Oct 18, 2015Updated 10 years ago
- java分布式爬虫,主机和从机控制的机制☆14May 21, 2015Updated 11 years ago
- 数据平台(DataPlateform),最初的设计想法是:当今大数据横行,我们也不能落后。所以就想着写一个这样的平台系统。此项目 集爬虫、搜索、Hadoop、Dwr推送、Quartz定时任务于一体的平台,其目的是想通过抓取互联网数据,通过大数据推测人或者某一事物的下一行为。C…☆18Jul 31, 2017Updated 8 years ago
- 关于通过百度地图API采集POI数据,并存储到HBase的项目。☆25Mar 14, 2016Updated 10 years ago
- a simple rpc framework for java☆14Dec 9, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 阿里巴巴大数据竞赛☆63Jun 2, 2014Updated 11 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- 基于Mole的一个企业级web应用的架子☆24Jan 30, 2015Updated 11 years ago
- Google 在 2018 年下旬开源了一款新的 Java 工具 Jib,可以轻松地将 Java 应用程序容器化。通过 Jib,我们不需要编写 Dockerfile 或安装 Docker,通过集成到 Maven 或 Gradle 插件,就可以立即将 Java 应用程序容器化…☆21Apr 7, 2019Updated 7 years ago
- j360系列 - 缓存异步写数据库的框架☆15Apr 14, 2016Updated 10 years ago
- 集成第三方登录功能(新浪微博,腾讯QQ,微信,公众号登录)☆24Mar 23, 2017Updated 9 years ago
- word,excel转pdf☆11Oct 26, 2018Updated 7 years ago
- Sync是一款分布式场景下基于Redis的安全高效的线程同步组件,提供分布式可重入互斥锁、分布式可重入读写锁、分布式信号量。提供相应注解,使用简单,可与spring-boot无缝集成。☆13Oct 8, 2022Updated 3 years ago
- JTS Topology Suite 1.14 with additional functions for GeoSpark☆14Jan 5, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于阿里Dubbo框架的服务切换工具☆19Jul 21, 2017Updated 8 years ago
- No More than a C build system for clang, gcc and msvc☆13Mar 20, 2025Updated last year
- douyin api,抖音上传接口,抖音接口,抖音搬家,视频备份☆19Aug 9, 2020Updated 5 years ago
- 云笔记项目,大数据项目,Hbase+Redis+Hadoop+Zookeeper☆13May 2, 2018Updated 8 years ago
- 一个基于redis消息队列的livePush推流器的分布式服务,修改部分livePush推流实现,采用监听redis消息队列方式控制推流和停止☆11Sep 22, 2016Updated 9 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 11 years ago
- mysql数据实时同步到redis,基于mysql binlog实现的同步方案☆10Dec 12, 2015Updated 10 years ago
- 微信短视频后端☆14Oct 11, 2023Updated 2 years ago
- 分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入☆13Dec 18, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Netty 源码分析,包含各种流程图☆11Jun 23, 2020Updated 5 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩展的☆16Nov 8, 2015Updated 10 years ago
- Excavator(挖掘机)是一个分布式的Java RMI框架。(求项目使用,有兴趣的可以电邮oldmanpushcart@gmail.com)☆51Jun 29, 2022Updated 3 years ago
- 开源项目,供学习☆10May 7, 2021Updated 5 years ago
- A python wrap for Baidu Yuyin API☆10Aug 3, 2016Updated 9 years ago
- zookeeper官方提供的分布式锁,选举master,和分布式队列实现☆16Mar 11, 2014Updated 12 years ago
- 小锋生活小助手——JAVA开发的基于爬虫和API实现的查询类微信公众号☆31Jun 7, 2018Updated 7 years ago
- js在线流程图 javascript online flow chart☆24Mar 11, 2019Updated 7 years ago
- 简单状态机实现。同时以简化的订单状态机为例子进行了说明。☆16Oct 13, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 基于 Spring 和 Jedis 的 Disque 封装,使用注解驱动☆22Feb 15, 2016Updated 10 years ago
- Integration of vertx-web & spring framework☆17Oct 18, 2017Updated 8 years ago
- 区块链+计步运动项目,主要采用以太坊、智能合约、springboot以及小程序等技术☆56Jul 26, 2018Updated 7 years ago
- Distributed in-memory cube base on java8 stream☆17Dec 6, 2014Updated 11 years ago
- Vert.x-Web 3.2.1 same as spring framework web, not dependent spring ,annotation develop. Vertx-RPC remote call, annotation .☆21Feb 10, 2023Updated 3 years ago
- 一个支持多级缓存的分布式缓存系统☆21Dec 27, 2017Updated 8 years ago
- a simple distributed spider in Java. Java编写的一个简单分布式爬虫☆160Jun 18, 2013Updated 12 years ago