hj2016 / hudi-test
☆13Updated 6 months ago
Alternatives and similar repositories for hudi-test:
Users that are interested in hudi-test are comparing it to the libraries listed below
- sql实现Structured Streaming☆39Updated 6 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 6 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago
- presto 源码分析☆51Updated 7 years ago
- My Blog☆76Updated 6 years ago
- Custom datasource about spark structure streaming☆12Updated 6 years ago
- ☆33Updated 5 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 5 months ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Updated 4 years ago
- A playground for Spark jobs.☆44Updated 6 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated last year
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- Spark源码剖析☆87Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 8 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆35Updated 5 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆31Updated 6 years ago
- Spark structured-streaming 消费kafka数据写入hbase☆33Updated 6 years ago
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆81Updated 3 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 2 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 7 years ago
- ☆106Updated last year
- ☆118Updated last year
- spark-scala-maven☆58Updated 6 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- 基于flink1.9.1,flink-sql-client模块SDK单独实现,支持Yarn集群的远程SQL任务发布,可以支撑flink sql任务的远程化执行☆48Updated 2 years ago
- Flink 菜鸟公众号代码地址☆64Updated 3 months ago
- Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.☆39Updated last year
- Flink Sql 教程☆34Updated 3 months ago
- Apache CarbonData Learning☆53Updated 5 years ago