xpleaf / data-extract-clean-analysisLinks
The project of data cleaning and data analysis based on MapReduce.
☆62Updated 7 years ago
Alternatives and similar repositories for data-extract-clean-analysis
Users that are interested in data-extract-clean-analysis are comparing it to the libraries listed below
Sorting:
- kafka spark hbase 日志统计☆79Updated 8 years ago
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆75Updated 2 years ago
- Spark、Hadoop、Flink、Storm、Kafka编程实例学习☆168Updated 8 years ago
- The real time project of storm for counting the pv and uv of a web site.☆34Updated 6 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆128Updated 7 years ago
- 本项目记录我学习hadoop和spark等开源框架的代码,因为也是最近才用github,之前都是荒废状态,故部分都是是之前写好的,现在上传至github☆86Updated 7 years ago
- A web app for the storm-statistic project.☆19Updated 7 years ago
- 基于Hadoop和HBase的大规模海量数据去重☆29Updated 7 years ago
- 分布式数据仓库最佳实践☆57Updated 7 years ago
- spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)☆83Updated 5 years ago
- tools for bigData☆37Updated 6 years ago
- winutils and hadoop lib for spark on windows_X64☆36Updated 8 years ago
- Flink代码实例☆122Updated 4 years ago
- 基于spark、mahout和spring boot构建的推荐系统☆131Updated last month
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Updated 2 years ago
- High Performance Spark Streaming with Direct Kafka in Java☆39Updated 8 years ago
- 基于spark streaming和kafka,hbase的日志统计分析系统☆262Updated 7 years ago
- hadoop_storm_spark结合实验的例子,模拟淘宝双11节,根据订单详细信息,汇总出总销售量,各个省份销售排行,以及后期的SQL分析,数据分析,数据挖掘等。 --------大概流程------- 第一阶段(storm实时报表) 第二阶段(离线报表)第三阶段(大规…☆322Updated 10 years ago
- bigdata note☆39Updated last year
- SparkSQL数据分析案例☆23Updated 8 years ago
- kafka传数据到Flink存储到mysql之Flink使用SQL语句聚合数据流(设置时间窗口,EventTime)☆32Updated 7 years ago
- elasticsearch+hbase海量数据查询,支持千万数据秒回查询☆281Updated 8 years ago
- Flink 案例代码☆43Updated 2 years ago
- 收集应用程序的log统一发送到kafka中☆30Updated last year
- Streaming 相关项目☆15Updated 8 years ago
- hadoop flume hbase kafka storm;读取kafka数据=》storm实时处理(分割字符,统计字符)=》写入hdfs☆21Updated 6 years ago
- 一个基于Spring Boot的Storm开发手脚架,开箱即用!集成读写Kafka、写Redis、写MySQL示例。☆59Updated 6 years ago
- poseidonX 是一个基于jstorm和flink的一体化实时计算服务平台☆55Updated 6 years ago
- flink简易使用教程,结合官方仓库的example样例,结合常见场景,使用flink的基本功能☆114Updated 2 years ago
- ☆38Updated 8 years ago