lihuigang / hive-bitmap-udf
在hive中使用Roaring64Bitmap实现精确去重功能
☆69Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for hive-bitmap-udf
- sql解析工具。主要解析hive sql、spark sql、presto sql。从sql中解析出输入表、输出表以及字段等信息☆94Updated last year
- Spark-2.3.1源码解读☆198Updated last year
- Compass is a task diagnosis platform for bigdata☆358Updated 2 months ago
- The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.☆369Updated 5 months ago
- Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.☆39Updated last year
- ☆443Updated 2 years ago
- Using Flink SQL to build ETL job☆200Updated last year
- Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.☆218Updated this week
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆385Updated 10 months ago
- 基于 Flink 的 sqlSubmit 程序☆144Updated 8 months ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆162Updated 3 years ago
- 本 GitHub 项目是 Flink Forward Asia Hackathon (2021) 的投票专用项目。☆121Updated 2 years ago
- ☆195Updated 2 weeks ago
- A simple project used to submit a Flink SQL script☆372Updated 5 years ago
- 汇总Apache Hudi相关资料☆537Updated this week
- Flink, algorithm and Java learning code, I hope it will be useful to you.☆67Updated last month
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆240Updated last year
- Platform for Flink☆283Updated last year
- Stream computing platform for bigdata☆404Updated 6 months ago
- For every learner☆309Updated 11 months ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated last month
- ☆21Updated 2 years ago
- spark 字段血缘 spark field lineage☆32Updated 2 years ago
- ☆490Updated 2 years ago
- Flink Connector for Apache Doris☆326Updated this week
- ☆117Updated last year
- 基于antlr4的sql解析,实现格式化,元数据,血源等自定义解析,包括hive☆107Updated last year
- 基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等☆45Updated last year
- FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Mas…☆121Updated last year
- ☆292Updated 2 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆136Updated 3 months ago