爬虫项目源码整理,使用redis进行url缓存,hbase进行详细信息的存储。使用zookeeper进行爬虫线程的状态监控。
☆19Oct 7, 2015Updated 10 years ago
Alternatives and similar repositories for spider
Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- akka学习理解,使用了maven、sbt两种构建方式,同时使用量java和scala两种语言实现。akka入门,清晰理解akka流程☆13Oct 18, 2015Updated 10 years ago
- java分布式爬虫,主机和从机控制的机制☆14May 21, 2015Updated 10 years ago
- image server base on nosql☆10Aug 24, 2016Updated 9 years ago
- 数据平台(DataPlateform),最初的设计想法是:当今大数据横行,我们也不能落后。所以就想着写一个这样的平台系统。此项目集爬虫、搜索、Hadoop、Dwr推送、Quartz定时任务于一体的平台,其目的是想通过抓取互联网数据,通过大数据推测人或者某一事物的下一行为。C…☆18Jul 31, 2017Updated 8 years ago
- 基于ffmpeg+spring+quartz+dubbo+zookeeper+MyBatis服务化的视频转换分布式服务☆12Jul 21, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 关于通过百度地图API采集POI数据,并存储到HBase的项目。☆25Mar 14, 2016Updated 10 years ago
- 利用WebMagic框架进行58同城数据的抓取☆12Oct 13, 2014Updated 11 years ago
- 阿里巴巴大数据竞赛☆63Jun 2, 2014Updated 11 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- 基于Mole的一个企业级web应用的架子☆24Jan 30, 2015Updated 11 years ago
- Google 在 2018 年下旬开源了一款新的 Java 工具 Jib,可以轻松地将 Java 应用程序容器化。通过 Jib,我们不需要编写 Dockerfile 或安装 Docker,通过集成到 Maven 或 Gradle 插件,就可以立即将 Java 应用程序容器化…☆21Apr 7, 2019Updated 6 years ago
- j360系列 - 缓存异步写数据库的框架☆15Apr 14, 2016Updated 9 years ago
- 集成第三方登录功能(新浪微博,腾讯QQ,微信,公众号登录)☆24Mar 23, 2017Updated 9 years ago
- 大数据实时计算的基础框架☆49Jan 12, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- rpc_learn Spring + Netty + Protostuff + ZooKeeper 实现了一个轻量级 RPC 框架,使用 Spring 提供依赖注入与参数配置,使用 Netty 实现 NIO 方式的数据传输,使用 Protostuff 实现对象序列化,使用 …☆19May 26, 2015Updated 10 years ago
- ☆11May 9, 2018Updated 7 years ago
- word,excel转pdf☆11Oct 26, 2018Updated 7 years ago
- 基于CAS单点登录服务端进行二次开发的SpringBoot版轻量级CAS-Server☆10Aug 4, 2022Updated 3 years ago
- Github flavored gatsby blog with github flavored markdown☆10Jul 19, 2023Updated 2 years ago
- Seckill system based on SpringBoot, MyBatis, Redis and so on.☆13May 16, 2017Updated 8 years ago
- 实时数据分析平台☆41Jun 26, 2013Updated 12 years ago
- Sync是一款分布式场景下基于Redis的安全高效的线程同步组件,提供分布式可重入互斥锁、分布式可重入读写锁、分布式信号量。提供相应注解,使用简单,可与spring-boot无缝集成。☆13Oct 8, 2022Updated 3 years ago
- 参加阿里巴巴中间件比赛时的mom项目源码☆22Nov 12, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- JTS Topology Suite 1.14 with additional functions for GeoSpark☆14Jan 5, 2018Updated 8 years ago
- 基于阿里Dubbo框架的服务切换工具☆19Jul 21, 2017Updated 8 years ago
- No More than a C build system for clang, gcc and msvc☆13Mar 20, 2025Updated last year
- douyin api,抖音上传接口,抖音接口,抖音搬家,视频备份☆19Aug 9, 2020Updated 5 years ago
- 云笔记项目,大数据项目,Hbase+Redis+Hadoop+Zookeeper☆13May 2, 2018Updated 7 years ago
- 百度爬虫:热词,词频,音乐,poi信息☆21Mar 10, 2015Updated 11 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- mysql数据实时同步到redis,基于mysql binlog实现的同步方案☆10Dec 12, 2015Updated 10 years ago
- 微信短视频后端☆14Oct 11, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入☆13Dec 18, 2015Updated 10 years ago
- Netty 源码分析,包含各种流程图☆11Jun 23, 2020Updated 5 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩 展的☆16Nov 8, 2015Updated 10 years ago
- Excavator(挖掘机)是一个分布式的Java RMI框架。(求项目使用,有兴趣的可以电邮oldmanpushcart@gmail.com)☆52Jun 29, 2022Updated 3 years ago
- 开源项目,供学习☆10May 7, 2021Updated 4 years ago
- 66道算法题目+php解题+java解题☆11Sep 12, 2018Updated 7 years ago
- Just a DEMO to demonstrate how to use JNA to type chars into alipay's password edit control automatically.☆12Dec 21, 2017Updated 8 years ago