enkhalifapro / bigdata-all-in-oneLinks
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆27Updated last year
Alternatives and similar repositories for bigdata-all-in-one
Users that are interested in bigdata-all-in-one are comparing it to the libraries listed below
Sorting:
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆77Updated 7 months ago
- 基于SparkSQL的电影分析项目实战☆40Updated 4 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated 2 years ago
- 一键搭建zookeeper/hadoop/hive/hbase/sqoop/kafka/spark/kylin☆34Updated 5 years ago
- SparkStreaming项目,显示flume->Kafka->Spark->hbase(实时数据处理方案),Scala实现☆36Updated 7 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆167Updated 4 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆70Updated 4 years ago
- 基于flink的用户行为分析☆51Updated last year
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Updated last year
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆22Updated 4 years ago
- 源码主要用于学习:1. Spring Boot+Hadoop+Hive+Hbase实现数据基本操作,Hive数据源使用Alibaba DruidDataSource,以及JDBCTemplate操作数据, Hbase使用hbase-client实现数据操作, API可视化界…☆22Updated 4 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆146Updated 10 months ago
- 机器学习项目☆38Updated 8 years ago
- 用户画像相关的参考代码☆157Updated 3 years ago
- 四川大学拓思爱诺用户session行为数据离线分析项目☆67Updated 3 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Updated 2 years ago
- 最全的大数据大厂面试宝典,大数据面试题,大数据面试,王傲旗的大数据之路,大数据成神之路,Flink/Spark/Hadoop/Hbase/Hive/Impala/Hbase/MapReduce/YARN/HDFS/Kafka/Flume/Linux/Java/Scala..…☆61Updated 3 years ago
- Spark 学习之路,包含 Spark Core,Spark SQL,Spark Streaming,Spark mllib 学习笔记☆145Updated 7 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Updated 5 years ago
- hadoop中Map/Reduce使用示例,输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作,增、删、查、改,使用Map/Reduce批量导入数据到HBase表中..…☆14Updated 12 years ago
- This project compose of two parts: 1) write, spark job to write to hbase using bulk load to; 2)read, rest api reading from hbase base on …☆20Updated 7 years ago
- spark streaming从kafka读取消息,offset写入Redis,spark计算单词出现频率,最后写入hive表☆17Updated 6 years ago
- Spark Streaming + kafka + hbase☆15Updated 6 years ago
- flink实时处理kafka传来的数据通过连接池技术写入hbase☆96Updated 3 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12Updated 4 years ago
- Spark源代码中文注释☆42Updated 6 years ago
- 基于SparkMLLib实现的商品推荐功能,包括:基于用户的协同过滤,基于物品的协同过滤,基于ALS交替最小二乘的协同过滤。☆37Updated 6 years ago
- 大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件☆40Updated last year
- spark 机器学习:利用jupyter工作来讲解算法原理并运行相关例子☆105Updated 8 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 6 years ago