enkhalifapro / bigdata-all-in-oneLinks
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆27Updated 2 years ago
Alternatives and similar repositories for bigdata-all-in-one
Users that are interested in bigdata-all-in-one are comparing it to the libraries listed below
Sorting:
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆80Updated 9 months ago
- 一键搭建zookeeper/hadoop/hive/hbase/sqoop/kafka/spark/kylin☆34Updated 5 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆70Updated 4 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Updated 4 years ago
- 基于SparkSQL的电影分析项目实战☆40Updated 4 years ago
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一 个kafka/flink)搭一套环境练习,又捡起来了☆22Updated 4 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated 2 years ago
- SparkStreaming项目,显示flume->Kafka->Spark->hbase(实时数据处理方案),Scala实现☆36Updated 7 years ago
- 源码主要用于学习:1. Spring Boot+Hadoop+Hive+Hbase实现数据基本操作,Hive数据源使用Alibaba DruidDataSource,以及JDBCTemplate操作数据, Hbase使用hbase-client实现数据操作, API可视化界…☆22Updated 4 years ago
- docker-hadoop-spark-hive 快速构建你的大数据环境☆21Updated 5 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12Updated 4 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆148Updated last year
- Spark 学习之路,包含 Spark Core,Spark SQL,Spark Streaming,Spark mllib 学习笔记☆145Updated 7 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆81Updated 2 years ago
- Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.☆32Updated 2 years ago
- 基于flink的用户行为分析☆50Updated 2 years ago
- 自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群,一主两从。☆30Updated 6 years ago
- docker构建大数据开发学习环境☆50Updated 8 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Updated 2 years ago
- 基于SparkMLLib实现的商品推荐功能,包括:基于用户的协同过滤,基于物品的协同过滤,基于ALS交替最小二乘的协同过滤。☆37Updated 6 years ago
- java 大数据 spark flink redis hive hbase kafka 面试题 数据结构 算法 设计模式☆22Updated 4 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Updated 2 years ago
- Apache Spark structured streaming connector for Yandex ClickHouse OLAP☆16Updated 8 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Updated 9 years ago
- ☆14Updated 8 years ago
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)☆85Updated 5 years ago
- 基于Spark SQL,可通过输入SQL语句操作HBase表,目前提供对HBase表的查询、创建、删除以及数据插入(需要自己指定rowKey生成规则)的功能,数据删除,分布式导入大规模数据相关功能正在开发中☆13Updated last year
- 机器学习项目☆38Updated 8 years ago