cas-bigdatalab / piflowLinks
πflow is a big data flow engine with spark support
☆537Updated 9 months ago
Alternatives and similar repositories for piflow
Users that are interested in piflow are comparing it to the libraries listed below
Sorting:
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆508Updated 2 years ago
- Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured h…☆451Updated 4 months ago
- Visualis is a BI tool for data visualization. It provides financial-grade data visualization capabilities on the basis of data security a…☆269Updated 8 months ago
- Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features…☆391Updated 2 weeks ago
- Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various data…☆750Updated 5 months ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆979Updated 2 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆285Updated 7 years ago
- WeDataSphere is a financial grade, one-stop big data platform suite.☆670Updated last year
- DataSphereStudio documents.☆122Updated 8 months ago
- Unified SQL Analytics Engine Based on SparkSQL☆212Updated 2 years ago
- Platform for Flink☆283Updated 2 years ago
- Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernete…☆390Updated 2 months ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆812Updated 9 months ago
- DBus☆1,213Updated 2 years ago
- Stream computing platform for bigdata☆407Updated last year
- ☆568Updated last year
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated 2 months ago
- 这是一个可自由拖拽的BI可视化 系统 支持主流的关系数据:MySQL,Oracle,PostgreSQL等 同时支持Apache Doris☆212Updated 3 years ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆286Updated 2 years ago
- Cluster manager for Apache Doris☆187Updated last year
- ☆63Updated 3 months ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Updated last year
- Spark、Flink等离线任务的调度以及实时任务的监控☆304Updated last year
- DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆138Updated 3 years ago
- A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype☆199Updated 5 years ago
- hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)☆364Updated 2 years ago
- Tapdata Live Data Platform Project☆602Updated this week
- datax数据同步elasticsearch的reader和writer插件,支持一对多的扁平数据转换成es的嵌套对象,也支持嵌套对象的读取和ognl表达式过滤,理论上可以无限嵌套。☆88Updated 2 months ago
- TipDM建模平台,开源的数据挖掘工具。☆232Updated 2 years ago
- Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.☆666Updated this week