xpleaf / data-extract-clean-analysis
The project of data cleaning and data analysis based on MapReduce.
☆62Updated 6 years ago
Alternatives and similar repositories for data-extract-clean-analysis:
Users that are interested in data-extract-clean-analysis are comparing it to the libraries listed below
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆74Updated 2 years ago
- kafka spark hbase 日志统计☆79Updated 8 years ago
- spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)☆83Updated 4 years ago
- 本项目记录我学习hadoop和spark等开源框架的代码,因为也是最近才用github,之前都是荒废状态,故部分都是是之前写好的,现在上传至github☆85Updated 7 years ago
- Spark、Hadoop、Flink、Storm、Kafka编程实例学习☆168Updated 7 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆128Updated 6 years ago
- Flink代码实例☆122Updated 4 years ago
- The real time project of storm for counting the pv and uv of a web site.☆34Updated 6 years ago
- 分布式数据仓库最佳实践☆57Updated 6 years ago
- log、event 、time 、window 、table、sql、connect、join、async IO、维表、CEP☆68Updated 2 years ago
- 基于Hadoop和HBase的大规模海量数据去重☆29Updated 6 years ago
- Flink 案例代码☆43Updated 2 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 3 years ago
- 《Spark大数据分析源码解析与实例详解》图书配套实例资源☆38Updated 2 years ago
- 一个基于Spring Boot的Storm开发手脚架,开箱即用!集成读写Kafka、写Redis、写MySQL示例。☆59Updated 6 years ago
- 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zookeeper,MemCached,Redis,Storm,Scala,Spark,Flink 等大数据框架的学习笔记☆77Updated 5 years ago
- 【bigdata】spirngboot+spark 脚手架+相关实例☆22Updated 2 years ago
- elasticsearch reader and writer plugin for datax☆39Updated 7 years ago
- elasticsearch+hbase海量数据查询,支持千万数据秒回查询☆281Updated 8 years ago
- flink实时处理kafka传来的数据通过连接池技术写入hbase☆95Updated 2 years ago
- 学习 Spark 的一个小项目,以及其中各种调优的笔记☆177Updated 7 years ago
- 基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆53Updated 2 years ago
- 同步Hive数据仓库数据到Elasticsearch的小工具☆21Updated 7 years ago
- Storm Kafka 流数据 处理系统☆20Updated 6 years ago
- 金融风控系统(springboot+drools)、flink流计算、mongodb☆154Updated 2 years ago
- [译] kudu 中文文档☆28Updated 3 years ago
- SparkSQL数据分析案例☆23Updated 8 years ago
- 大数据组件学习;包括dataflow,spring cloud stream;elasticsearch;flink;spark;kafka;phoenix;Hive;Hbase;☆23Updated 2 years ago
- 基于Java,封装了hbase的底层api,提供了基于注解的ORM支持,只需定义实体类对象,即可完成对hbase的各种操作。同时对List、Set、Map等复杂数据类型提供了支持☆43Updated 8 years ago
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Updated 2 years ago