hulichao / docker-bigdata
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for docker-bigdata
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆136Updated last month
- 反应式 海量数据治理平台☆38Updated 4 years ago
- 简单易用的ETL工具☆17Updated 5 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 2 years ago
- DataX分布式集群与负载均衡、任务执行/统计,基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆42Updated 3 years ago
- ☆18Updated 3 years ago
- 数据治理、数据标准相关的 web 工具☆35Updated 2 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆60Updated 11 months ago
- ☆57Updated last year
- tis console web ui dashboard☆13Updated 2 weeks ago
- 记录HBase版本API的变迁Demo☆33Updated 5 years ago
- 大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件☆66Updated 6 years ago
- ☆14Updated 2 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆31Updated last year
- 超实用的hive表数据、分区,hdfs文件的自动化清理工具☆20Updated 2 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆31Updated 2 years ago
- 基于DataX的数据同步任务调度工具,支持自定义定时任务,支持crontab表达式,支持自定义添加DataX数据同步任务☆39Updated 5 years ago
- ☆63Updated this week
- Apache StreamPark quickstart☆67Updated 4 months ago
- Flink 案例开发数据清洗、数据报表☆52Updated 2 years ago
- 基于Flink+ClickHouse实时计算平台☆30Updated 2 years ago
- HiveReader for alibaba DataX☆18Updated last year
- ☆46Updated last year
- Apache Hudi Demo☆21Updated 4 months ago
- ☆38Updated last year
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- 数据质量控制系统☆44Updated 3 years ago
- 数仓项目☆10Updated 5 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆32Updated 2 years ago