huangfox / dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
☆229Updated 3 months ago
Alternatives and similar repositories for dpkb:
Users that are interested in dpkb are comparing it to the libraries listed below
- Flink源码阅读分享,不断记录Flink源码的阅读过程☆90Updated 5 months ago
- presto、trino资料分享,开发文档、源码阅读、二次开发。☆61Updated 2 months ago
- 汇总Apache Hudi相关资料☆549Updated 2 weeks ago
- Test code for apache calcite☆213Updated 2 years ago
- 本 GitHub 项目是 Flink Forward Asia Hackathon (2021) 的投票专用项目。☆122Updated 3 years ago
- ☆189Updated 3 years ago
- OLAP Database Performance Tuning Guide☆372Updated last year
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆229Updated 8 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆255Updated 10 months ago
- 汇总Apache Iceberg相关的最新文章、资料以及Demo等☆32Updated 3 years ago
- ☆200Updated 2 months ago
- Compass is a task diagnosis platform for bigdata☆377Updated 4 months ago
- The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.☆379Updated 10 months ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆162Updated 3 years ago
- flink 流处理源码分析☆75Updated 5 years ago
- Spark-2.3.1源码解读☆198Updated 2 years ago
- 基于antlr4的sql解析,实现格式化,元数据,血源等自定义解析,包括hive☆111Updated 2 years ago
- calcite的相关联系代码,包含CSV适配器,使用CSV适配器来进行SQL查询。SQL的parse和validate,以及RBO和CBO的使用。☆71Updated 4 years ago
- Remote Shuffle Service for Flink☆189Updated 2 years ago
- Platform for Flink☆281Updated 2 years ago
- FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Mas…☆134Updated last year
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 5 years ago
- https://blog.csdn.net/QXC1281/article/details/89070285☆538Updated 2 years ago
- sql解析工具。主要解析hive sql、spark sql、presto sql。从sql中解析出输入表、输出表以及字段等信息☆94Updated last year
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆255Updated last year
- For every learner☆310Updated last year
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。☆263Updated last year
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆409Updated this week
- Benchmarks for Apache Flink☆175Updated last month
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Updated 4 years ago