s1mplecc / spark-hadoop-docker
☆141Updated 3 years ago
Alternatives and similar repositories for spark-hadoop-docker:
Users that are interested in spark-hadoop-docker are comparing it to the libraries listed below
- A Hadoop cluster based on Docker, including Hive and Spark.☆79Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆141Updated 7 months ago
- ☆127Updated 3 years ago
- 基于 PyFlink 的学习文档,通过一个个小实践,便于大家快速入手 PyFlink☆272Updated 3 years ago
- 一个实时数仓项目,从0到1搭建实时数仓☆57Updated 3 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆30Updated 2 years ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆302Updated 5 years ago
- 基于Flink流处理的动态实时亿级全端用户画像系统☆477Updated 2 years ago
- 电商平台数据仓库搭建☆129Updated 2 months ago
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆56Updated 3 months ago
- 记录HBase版本API的变迁Demo☆33Updated 5 years ago
- 数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等☆375Updated last month
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。☆264Updated last year
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆164Updated 4 years ago
- 一键搭建zookeeper/hadoop/hive/hbase/sqoop/kafka/spark/kylin☆34Updated 5 years ago
- 大数据面试题,从0到1走向架构师之路。Flink、Spark、Hive、HBase、Hadoop、Kettle、Kafka...☆270Updated 4 years ago
- BigData Learning Notes☆50Updated 5 months ago
- 从数据仓库到用户画像,从数据建设到数据应用☆580Updated 3 years ago
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 2 years ago
- flink简易使用教程,结合官方仓库的example样例,结合常见场景,使用flink的基本功能☆113Updated 2 years ago
- flink学习笔记☆386Updated 2 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆72Updated 3 months ago
- Spark、Flink等离线任务的调度以及实时任务的监控☆299Updated last year
- Using Flink SQL to build ETL job☆203Updated last year
- 大数据环境一键安装脚本☆51Updated 4 years ago
- Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.☆237Updated last week
- 大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse☆231Updated 4 months ago
- 大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块☆502Updated last week
- 本 GitHub 项目是 Flink Forward Asia Hackathon (2021) 的投票专用项目。☆121Updated 3 years ago
- Flink源码阅读分享,不断记录Flink源码的阅读过程☆90Updated 6 months ago