hj2016 / hudi-test
☆13Updated 4 months ago
Alternatives and similar repositories for hudi-test:
Users that are interested in hudi-test are comparing it to the libraries listed below
- sql实现Structured Streaming☆39Updated 6 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 5 years ago
- My Blog☆76Updated 6 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- presto 源码分析☆51Updated 6 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Updated 6 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 3 months ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆154Updated last year
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 2 years ago
- A playground for Spark jobs.☆44Updated 6 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆90Updated 5 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Updated 4 years ago
- Flink Sql 教程☆34Updated last month
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- spark-scala-maven☆58Updated 6 years ago
- flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0☆110Updated 2 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆35Updated 5 years ago
- ☆118Updated last year
- 跟踪Spark-sql中的字段血缘关系☆20Updated 2 months ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 7 years ago
- Spark源码剖析☆87Updated 7 years ago
- ☆15Updated 10 months ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 7 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 8 years ago
- Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.☆39Updated last year
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆81Updated 3 years ago
- Apache CarbonData Learning☆53Updated 4 years ago
- sql解析工具。主要解析hive sql、spark sql、presto sql。从sql中解析出输入表、输出表以及字段等信息☆94Updated last year