The book of data warehouse
☆196Oct 13, 2022Updated 3 years ago
Alternatives and similar repositories for data-warehouse
Users that are interested in data-warehouse are comparing it to the libraries listed below
Sorting:
- ☆77Oct 22, 2018Updated 7 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- 论文阅读总结☆32Jun 13, 2019Updated 6 years ago
- hadoop各组件使用,持续更新☆901Jan 4, 2023Updated 3 years ago
- hive仓库元数据管理系统☆167Aug 3, 2016Updated 9 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,843May 29, 2024Updated last year
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆180Apr 15, 2021Updated 4 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆380Dec 16, 2023Updated 2 years ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行☆11Sep 16, 2022Updated 3 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- mysql2hiveql is a tool to convert MySQL queries into Hive queries (HiveQL)☆14Aug 30, 2012Updated 13 years ago
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆19Dec 18, 2020Updated 5 years ago
- 电商平台数据仓库搭建☆138Jan 29, 2025Updated last year
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated 3 months ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Jun 21, 2022Updated 3 years ago
- A data integration framework☆4,109Dec 2, 2025Updated 3 months ago
- ☆22Feb 24, 2020Updated 6 years ago
- DBus☆1,212Dec 6, 2022Updated 3 years ago
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,062Feb 21, 2024Updated 2 years ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Flink 中文视频课程(持续更新...)☆4,626Jun 18, 2020Updated 5 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,483May 18, 2022Updated 3 years ago
- flink学习笔记,包含DataSet、DataStream、Window、缓存、Source、Sink相关说明、水印及示例代码☆12Jul 22, 2023Updated 2 years ago
- 从数据仓库到用户画像,从数据建设到数据应用☆626Jan 26, 2022Updated 4 years ago
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,058Mar 9, 2026Updated last week
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- Real-Time Analysis Integration with Kafka in Apache Spark’s Structured Streaming☆58Mar 24, 2018Updated 7 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆91May 26, 2019Updated 6 years ago
- 使用spark + kudu的案例☆15Sep 13, 2017Updated 8 years ago
- Flume Source to import data from SQL Databases☆266Dec 3, 2020Updated 5 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- Yet-Another-Rules-Engine -- A easy-to-understand Business Readable DSL for defining production rules.☆14Mar 24, 2021Updated 4 years ago
- Mirror of Apache griffin☆1,171Aug 3, 2025Updated 7 months ago
- Kibana Export Spy Plugin (JSON, CSV, XML)☆10Aug 30, 2016Updated 9 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆129Mar 29, 2018Updated 7 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆285May 24, 2018Updated 7 years ago
- ☆22Jun 21, 2022Updated 3 years ago