chucheng92 / HadoopDedup
基于Hadoop和HBase的大规模海量数据去重
☆29Updated 6 years ago
Alternatives and similar repositories for HadoopDedup:
Users that are interested in HadoopDedup are comparing it to the libraries listed below
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆74Updated 2 years ago
- hadoop flume hbase kafka storm;读取kafka数据=》storm实时处理(分割字符,统计字符)=》写入hdfs☆21Updated 6 years ago
- 使用Storm实时处理交通大数据(数据源:kafka,集群管理:zookeeper)☆51Updated 2 years ago
- Storm Kafka 流数据 处理系统☆20Updated 6 years ago
- tools for bigData☆37Updated 6 years ago
- phoenix 操作hbase和springboot 的整合☆11Updated 7 years ago
- kafka spark hbase 日志统计☆79Updated 8 years ago
- 大数据组件学习;包括dataflow,spring cloud stream;elasticsearch;flink;spark;kafka;phoenix;Hive;Hbase;☆23Updated 2 years ago
- 论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)☆33Updated 6 years ago
- Streaming 相关项目☆15Updated 7 years ago
- SparkStreaming项目,显示flume->Kafka->Spark->hbase(实时数据处理方案),Scala实现☆36Updated 6 years ago
- 基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆53Updated 2 years ago
- SparkSQL数据分析案例☆23Updated 8 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Updated 7 years ago
- The real time project of storm for counting the pv and uv of a web site.☆34Updated 6 years ago
- 基于Flink流处理的动态实时亿级全端用户画像系统可视化界面☆34Updated 2 years ago
- Use Scala API to read/write data from different databases,HBase,MySQL,etc.☆24Updated 6 years ago
- 使用spark对hive、hbase、ES的读写, 实现一次配置可对不同数据库进行导入导出,并对ES、hbase进行封装☆32Updated 7 years ago
- 【bigdata】spirngboot+spark 脚手架+相关实例☆22Updated 2 years ago
- hbase+solr实现hbase的二级索引☆48Updated 3 years ago
- Spark Sql进行离线日志分析,Java Web+Echarts+Ajax进行数据可视化展示☆27Updated 6 years ago
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Updated 2 years ago
- The project of data cleaning and data analysis based on MapReduce.☆62Updated 6 years ago
- 一个基于Spring Boot的Storm开发手脚架,开箱即用!集成读写Kafka、写Redis、写MySQL示例。☆59Updated 6 years ago
- 通过HBase Observer同步数据到ElasticSearch☆55Updated 9 years ago
- Spark、Hadoop、Flink、Storm、Kafka编程实例学习☆168Updated 7 years ago
- 日志分析器,仿造elk中logstash的简单Java实现,实现监控目录日志,自动解析存入elasticsearch。☆22Updated 8 years ago
- 基于spark-ml,spark-mllib,spark-streaming的推荐算法实现☆96Updated 5 years ago
- flink rest api的spring-boot-starter☆17Updated last year
- flink sql☆11Updated 2 years ago