terry-chelsea / bigdataLinks
☆25Updated 8 years ago
Alternatives and similar repositories for bigdata
Users that are interested in bigdata are comparing it to the libraries listed below
Sorting:
- 对yarn的的RM,NM模块代码进行分析☆49Updated 7 years ago
- kudu学习的一些资料,以及和spark/impala的集成使用☆33Updated 8 years ago
- My Blog☆76Updated 7 years ago
- Serviceframework一个简单但灵活的模块引擎☆31Updated 8 years ago
- Apache CarbonData 源码阅读☆62Updated 5 years ago
- ☆75Updated 12 years ago
- 通过HBase Observer同步数据到ElasticSearch☆54Updated 10 years ago
- presto 源码分析☆51Updated 7 years ago
- Flink Forward 2017-04-10 &11 ppt☆57Updated 8 years ago
- ☆131Updated 6 years ago
- work flow schedule☆91Updated 8 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Updated 8 years ago
- ☆91Updated 2 years ago
- Distributed SQL query engine for big data☆55Updated 11 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 9 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆242Updated 2 years ago
- mysql数据实时增量导入hive☆87Updated 8 years ago
- ☆236Updated 3 years ago
- spark-scala-maven☆59Updated 6 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- Spark源码剖析☆87Updated 7 years ago
- A high performance in-memory hive sql engine based on Apache Calcite☆192Updated 3 years ago
- Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus☆158Updated 9 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆129Updated 7 years ago
- flink技术学习笔记分享☆81Updated 6 years ago
- Guardian of Waterdrop and Spark☆30Updated 2 years ago
- Java library to integrate Flink and Kudu☆55Updated 8 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆108Updated 6 years ago