enkhalifapro / bigdata-all-in-oneLinks
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆27Updated last year
Alternatives and similar repositories for bigdata-all-in-one
Users that are interested in bigdata-all-in-one are comparing it to the libraries listed below
Sorting:
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆74Updated 5 months ago
- 基于SparkSQL的电影分析项目实战☆40Updated 4 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Updated 4 years ago
- 一键搭建zookeeper/hadoop/hive/hbase/sqoop/kafka/spark/kylin☆34Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆165Updated 4 years ago
- Spark Streaming + kafka + hbase☆15Updated 6 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 6 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆30Updated 2 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated last year
- 《Spark: The Definitive Guide Big Data Processing Made Simple》学习心得,说翻译嘛也不算完全翻译吧,只能说以个人经验和理解重新叙述一遍。同步更新在掘金上,点链接可跳转☆34Updated 5 years ago
- 自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群,一主两从。☆30Updated 6 years ago
- SparkStreaming项目,显示flume->Kafka->Spark->hbase(实时数据处理方案),Scala实现☆36Updated 7 years ago
- 基于 Spark Streaming + ALS 的餐饮推荐系统☆88Updated 6 years ago
- 基于SparkMLLib实现的商品推荐功能,包括:基于用户的协同过滤,基于物品的协同过滤,基于ALS交替最小二乘的协同过滤。☆36Updated 6 years ago
- Spark中机器学习算法包使用案例☆9Updated 7 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆142Updated 8 months ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 6 years ago
- docker-hadoop-spark-hive 快速构建你的大数据环境☆22Updated 5 years ago
- 基于flink的用户行为分析☆51Updated last year
- 简单易用的ETL工具☆17Updated 6 years ago
- Spark中实现用户画像系统价值度、忠诚度、流失预警、活跃度等模型☆66Updated 8 years ago
- 大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件☆40Updated last year
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆30Updated 3 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Updated 9 years ago
- 机器学习项目☆38Updated 8 years ago
- 基于spark-ml,spark-mllib,spark-streaming的推荐算法实现☆96Updated 6 years ago
- 《Spark 快速大数据分析》学习笔记☆43Updated last year
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 3 years ago
- docker构建大数据开发学习环境☆50Updated 8 years ago
- Spark机器学习书代码☆25Updated 7 years ago