hulichao / docker-bigdataLinks
☆25Updated last year
Alternatives and similar repositories for docker-bigdata
Users that are interested in docker-bigdata are comparing it to the libraries listed below
Sorting:
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆145Updated 9 months ago
- 简单易用的ETL工具☆17Updated 6 years ago
- 反应式 海量数据治理平台☆41Updated 4 years ago
- 基于DataX的数据同步任务调度工具,支持自定义定时任务,支持crontab表达式,支持自定义添加DataX数据同步任务☆39Updated 6 years ago
- ☆56Updated 2 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- DataX分布式集群与负载均衡、任务执行/统计,基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆43Updated 4 years ago
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 3 years ago
- CDH安装手册☆86Updated 2 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆72Updated 3 years ago
- Ambari集成Apache Kylin服务(离线部署、可支持HDP2.6+及HDP3.0+)☆37Updated 4 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆32Updated 3 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆31Updated 2 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆79Updated last year
- DorisDB SQL解析器Java实现;Clickhouse SQL解析器Java实现☆95Updated 2 years ago
- ☆38Updated last year
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆107Updated 2 months ago
- flink endpoint for open world☆27Updated last month
- 记录HBase版本API的变迁Demo☆33Updated 6 years ago
- Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) …☆25Updated 3 years ago
- DataX分布式集群化、自定义DataX插件、源码修改任务监控以及脏数据存表Hook☆26Updated 4 years ago
- ☆28Updated 3 years ago
- ☆62Updated 2 weeks ago
- Apache StreamPark quickstart☆72Updated 4 months ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Updated 6 years ago
- EOI数据中台产品☆31Updated 2 years ago
- kudu可视化工具☆38Updated 5 years ago
- Data quality check tools by execute sql☆21Updated 7 years ago
- Guardian of Waterdrop and Spark☆30Updated 2 years ago