Shockang / spark-examples
致力于提供最具实践性的 Spark 代码开发学习指南
☆11Updated 2 years ago
Alternatives and similar repositories for spark-examples:
Users that are interested in spark-examples are comparing it to the libraries listed below
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Updated 6 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆90Updated 5 years ago
- ☆15Updated 10 months ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 2 years ago
- Spark源码剖析☆87Updated 7 years ago
- My Blog☆76Updated 6 years ago
- spark-scala-maven☆58Updated 6 years ago
- A playground for Spark jobs.☆44Updated 6 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- spark,hive,等SQL解析案例☆40Updated 6 years ago
- Apache CarbonData Learning☆53Updated 4 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆154Updated last year
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 3 months ago
- sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行☆11Updated 2 years ago
- 跟踪Spark-sql中的字段血缘关系☆20Updated 2 months ago
- A RPC framework leveraging Spark RPC module☆211Updated 5 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Updated 4 years ago
- TPC-DS Performance tests tool for Flink☆29Updated 3 years ago
- presto 源码分析☆51Updated 6 years ago
- ☆11Updated 2 years ago
- Custom datasource about spark structure streaming☆13Updated 5 years ago
- Spark Streaming,Kafka and HBase code accompanying the blog 'Offset Management For Apache Kafka With Apache Spark Streaming'☆23Updated 7 years ago
- A New Way of Data Lake☆48Updated 3 years ago
- ☆76Updated 11 years ago
- calcite的相关联系代码,包含CSV适配器,使用CSV适配器来进行SQL查询。SQL的parse和validate,以及RBO和CBO的使用。☆70Updated 4 years ago
- Playground for Flink Table Store with use cases and performance features☆48Updated last year
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 7 years ago
- Spark源代码中文注释☆42Updated 6 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago