dantezhao / data-groupLinks
☆77Updated 7 years ago
Alternatives and similar repositories for data-group
Users that are interested in data-group are comparing it to the libraries listed below
Sorting:
- azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。☆117Updated 8 years ago
- mysql数据实时增量导入hive☆87Updated 8 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆212Updated 3 years ago
- 2017年易观olap大赛☆119Updated 7 years ago
- My Blog☆76Updated 7 years ago
- ☆133Updated 8 years ago
- ☆236Updated 3 years ago
- spark example code, has some production practice.☆178Updated 9 years ago
- Some useful custom hive udf functions, especial array, json, math, string functions.☆227Updated last year
- Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus☆159Updated 9 years ago
- spark实例代码☆78Updated 8 years ago
- Stream computing platform for bigdata☆407Updated last year
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆382Updated 2 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆286Updated 7 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆243Updated 3 years ago
- The book of data warehouse☆196Updated 3 years ago
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Updated 4 years ago
- spark-summit-north-america-2018-06, More detail please visit☆111Updated 7 years ago
- ☆135Updated 7 years ago
- hbase-tools try easy to use and test the hbase,☆46Updated 11 years ago
- Spark源码剖析☆86Updated 8 years ago
- Spark Streaming,Kafka and HBase code accompanying the blog 'Offset Management For Apache Kafka With Apache Spark Streaming'☆23Updated 8 years ago
- Flink 官方文档中文翻译项目☆382Updated last year
- ☆43Updated 6 years ago
- Spark 编程指南简体中文版☆192Updated 2 years ago
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆146Updated 2 months ago
- A library based on delta for Spark and MLSQL☆61Updated 5 years ago
- ☆131Updated 7 years ago
- ☆126Updated last week