cas-bigdatalab / piflowLinks
πflow is a big data flow engine with spark support
☆535Updated 6 months ago
Alternatives and similar repositories for piflow
Users that are interested in piflow are comparing it to the libraries listed below
Sorting:
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Updated 2 years ago
- Visualis is a BI tool for data visualization. It provides financial-grade data visualization capabilities on the basis of data security a…☆263Updated 5 months ago
- Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various data…☆742Updated 2 months ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆977Updated 2 years ago
- Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured h…☆448Updated 2 months ago
- Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features…☆391Updated 3 weeks ago
- Platform for Flink☆281Updated 2 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆210Updated 2 years ago
- hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)☆361Updated last year
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆283Updated 7 years ago
- Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernete…☆383Updated last week
- Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.☆627Updated last month
- DBus☆1,211Updated 2 years ago
- Stream computing platform for bigdata☆403Updated last year
- Spark、Flink等离线任务的调度以及实时任务的监控☆303Updated last year
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Updated last year
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,059Updated 2 years ago
- The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-n…☆1,224Updated 10 months ago
- 基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresq…☆353Updated last week
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆811Updated 6 months ago
- 这是一个可自由拖拽的BI可视化系统 支持主流的关系数据:MySQL,Oracle,PostgreSQL等 同时支持Apache Doris☆205Updated 2 years ago
- hadoop各组件使用,持续更新☆902Updated 2 years ago
- DataSphereStudio documents.☆118Updated 5 months ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆284Updated 2 years ago
- ☆566Updated last year
- dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unificat…☆111Updated last year
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated last month
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其 他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆241Updated 2 years ago
- ☆491Updated 2 years ago
- 数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)☆186Updated 2 years ago