NextMark / datashops
A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, python, mysql etc.
☆32Updated 2 years ago
Alternatives and similar repositories for datashops:
Users that are interested in datashops are comparing it to the libraries listed below
- ☆61Updated 2 months ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆81Updated 3 years ago
- 基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆53Updated 2 years ago
- 数据采集平台zdh,etl 处理服务☆70Updated 2 weeks ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆105Updated 2 months ago
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Updated 2 years ago
- flink endpoint for open world☆26Updated last year
- 针对datax进行2次开发,实现data 以rpc的方式传递json配置调用推数服务,同时修复datax多处bug。项目中也引入nacos作为服务的配置中心和注册中心; 同时项目内扩展了kafkawriter,rabbitmqwriter,esreader,hiveread…☆68Updated 2 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆78Updated last year
- sql 血缘解析(hive sql、spark sql、starrocks sql、doris sql)☆21Updated 2 years ago
- 内嵌AI的数据质量控制系统☆45Updated 3 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 3 years ago
- java性能采集工具☆51Updated 6 years ago
- EOI数据中台产品☆30Updated 2 years ago
- Doris表和字段血缘项目☆78Updated 10 months ago
- 反应式 海量数据治理平台☆40Updated 4 years ago
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- datax web。datax中的web配置界面没有集成在一起开源出来,此为web端配置项目。☆100Updated 6 years ago
- Flink Sql 教程☆34Updated 3 months ago
- DorisDB SQL解析器Java实现;Clickhouse SQL解析器Java实现☆93Updated 2 years ago
- 基于flink 1.8 源码二次开发,详见MD☆82Updated 4 years ago
- kudu可视化工具☆38Updated 4 years ago
- 基于插件架构的数据源服务,统一接口,可操作不同类型数据源☆46Updated 2 years ago
- Flink 案例代码☆43Updated 2 years ago
- 电商用户行为分析大数据平台☆33Updated 5 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Updated 5 years ago
- ☆69Updated 2 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 2 years ago
- Flink 案例开发数据清洗、数据报表☆52Updated 2 years ago