lqkweb / sqlflowLinks
SQLflow based on python development, support to Spark, as the underlying distributed computing engine, through a set of unified configuration file to complete the batch, flow calculation, the Rest service development.
☆133Updated 2 years ago
Alternatives and similar repositories for sqlflow
Users that are interested in sqlflow are comparing it to the libraries listed below
Sorting:
- ☆42Updated 6 years ago
- ☆28Updated 8 years ago
- azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。☆117Updated 8 years ago
- mysql数据实时增量导入hive☆87Updated 8 years ago
- ☆77Updated 6 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆283Updated 7 years ago
- Data quality check tools by execute sql☆21Updated 7 years ago
- SparkSQL数据分析案例☆23Updated 8 years ago
- 流程化 机器学习框架 基于 scala java语言 ,一站式自动机器学习平台 ,主要包括数据分析 特征工程 ,机器模型,自动部署,超参数优化,模型自动优化,自动扩容分配创建功能,类似第四范式、阿里PAI平台、google autoMl、亚马逊SageMaker☆65Updated 6 years ago
- 分布式数据仓库最佳实践☆57Updated 7 years ago
- [译] Airflow 中文文档☆213Updated last year
- 同步Hive数据仓库数据到Elasticsearch的小工具☆21Updated 7 years ago
- example☆66Updated 5 years ago
- spark算子使用例子, spark RDD的算子挺多,有时候如何灵活的使用,该如何用一下子想不起来,这一段时间将spark的算子如何使用的例子给记录了下来,下面是spark RDD 的一些常用算子的使用 这些算子包括有java的,也有scala的语言(博客中才有),由于精…☆36Updated 6 years ago
- The Data Processer☆95Updated 8 years ago
- Apache Storm 官方文档中文版☆143Updated 4 years ago
- 睿思BI-OLAP开源多维分析系统☆110Updated 7 years ago
- 基于CDH5.x parcles安装,一键卸载脚本☆38Updated 2 years ago
- 写点一路的风景,都很普通,主要还是留给自己。请访问:http://guzhenping.com☆22Updated 5 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆210Updated 2 years ago
- Apache Kylin Python Client Library☆63Updated 2 years ago
- Ambari集成Apache Kylin服务(离线部署 、可支持HDP2.6+及HDP3.0+)☆37Updated 4 years ago
- hive仓库元数据管理系统☆166Updated 8 years ago
- 数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)☆186Updated 2 years ago
- kudu学习的一些资料,以及和spark/impala的集成使用☆33Updated 7 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆128Updated 7 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆32Updated 3 years ago
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆75Updated 2 years ago
- Flink 案例代码☆43Updated 3 years ago
- 大数据组件 All-in-One 的 Dockerfile☆95Updated 7 months ago