spancer/bigdata-docker-builds

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spancer/bigdata-docker-builds)

spancer / bigdata-docker-builds

Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.

☆32

Alternatives and similar repositories for bigdata-docker-builds

Users that are interested in bigdata-docker-builds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spancer / bigdata-docker-compose
View on GitHub
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
☆150Sep 23, 2024Updated last year
lschampion / bigdata-docker-compose
View on GitHub
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
☆10Apr 30, 2022Updated 4 years ago
thestyleofme / spark-gmall-parent
View on GitHub
基于spark-streaming的实时数仓
☆12Jul 23, 2023Updated 2 years ago
Huac233 / MLSub
View on GitHub
订阅转换，添加免流host
☆11Feb 16, 2023Updated 3 years ago
xiongmozhou / gmall-realtime
View on GitHub
数仓实时项目
☆10May 9, 2019Updated 7 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cmdviegas / hadoop-spark
View on GitHub
This is a script to deploy a cluster with Apache Hadoop and Apache Spark + Apache Hive in distributed mode using Docker as infrastructure…
☆26Feb 25, 2026Updated 4 months ago
spancer / zeus
View on GitHub
Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) …
☆27Nov 2, 2021Updated 4 years ago
MuziMin0222 / AnalysisOfUserBehaviors
View on GitHub
基于spark的电商用户行为分析系统
☆17Mar 23, 2023Updated 3 years ago
fabiogjardim / bigdata_docker
View on GitHub
Big Data Ecosystem Docker
☆428Apr 29, 2023Updated 3 years ago
maguichang / DataQuality
View on GitHub
数据治理->数据质量
☆12Jun 5, 2019Updated 7 years ago
liaxiufeng / cloudbox
View on GitHub
网盘前端（vue）
☆12May 28, 2022Updated 4 years ago
hrchlhck / k8s-bigdata
View on GitHub
Apache Spark with HDFS cluster within Kubernetes
☆12Jul 11, 2023Updated 3 years ago
panovvv / bigdata-docker-compose
View on GitHub
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
☆168Feb 4, 2021Updated 5 years ago
tophua / spark1.52
View on GitHub
Spark源代码中文注释
☆42Aug 22, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
coderblack / doe-data
View on GitHub
学大数据，上多易教育
☆29Sep 30, 2022Updated 3 years ago
wangAoqi666 / bigdata-interview
View on GitHub
最全的大数据大厂面试宝典，大数据面试题，大数据面试，王傲旗的大数据之路，大数据成神之路，Flink/Spark/Hadoop/Hbase/Hive/Impala/Hbase/MapReduce/YARN/HDFS/Kafka/Flume/Linux/Java/Scala..…
☆63Dec 6, 2021Updated 4 years ago
wgzhao / presto-clickhouse
View on GitHub
ClickHouse connector both for PrestoSQL and Trino
☆18Mar 3, 2021Updated 5 years ago
darlinglele / relational
View on GitHub
关联分析: 频繁项集、关联规则
☆12Jul 31, 2014Updated 11 years ago
datyrlab / python-pyspark-framework
View on GitHub
pyspark framework
☆25Feb 22, 2022Updated 4 years ago
Teanix / CloudRestaurant
View on GitHub
Go实战线上餐厅项目
☆10Jan 5, 2022Updated 4 years ago
gitlbo / hive
View on GitHub
Apache Hive
☆13Jan 3, 2021Updated 5 years ago
mayi295940 / streampark-piflowx
View on GitHub
☆17Jul 9, 2025Updated last year
yszhdhy / generator
View on GitHub
☆14May 10, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
niunaruto / NNTimerLoop
View on GitHub
封装的一个倒计时功能的demo
☆10Feb 17, 2017Updated 9 years ago
baifendian / RedisSentinelClient
View on GitHub
基于sentinel的redis集群的客户端, 支持自动主从切换, 采用ketama作为一致性hash算法
☆28Jun 9, 2017Updated 9 years ago
NingningLi / data-cleaning
View on GitHub
数据清洗系统；hadoop；实体识别；冲突消解；不一致修复；缺失值填充
☆18Apr 28, 2016Updated 10 years ago
n0vad3v / simple-multinode-clickhouse-cluster
View on GitHub
Deploy a simple Multi-Node Clickhouse Cluster with docker-compose in minutes.
☆17Feb 11, 2022Updated 4 years ago
channels-frontend / django-channels
View on GitHub
A convenience library to handle ASGI messages over websockets
☆19Jan 9, 2023Updated 3 years ago
unwiredlabs / locationapi-client-libraries
View on GitHub
Contains the OpenAPI Specification (v3) for LocationAPI and client libraries generated by the openapi-generator https://openapi-generator…
☆11Jul 5, 2023Updated 3 years ago
BlackBoxVision / mui-audio-player
View on GitHub
🚀 Material-UI based Audio Player
☆10Dec 2, 2022Updated 3 years ago
slimina / ip-line-analysis
View on GitHub
根据apnic分析国内IP地址线路（联通、电信、移动等）
☆10Aug 16, 2017Updated 8 years ago
ankushs92 / Spring-Boot-DB-IP
View on GitHub
☆13May 23, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
askmrsinh / big-data-tools-install
View on GitHub
Scripts for installing Hadoop, HBase, Hive, Pig & Spark.
☆10Nov 13, 2019Updated 6 years ago
collabH / flink-learn
View on GitHub
flink learning
☆12Jul 12, 2024Updated 2 years ago
zhaomengit / raft-comment
View on GitHub
对etcd中实现的raft算法进行注释
☆11Jun 6, 2017Updated 9 years ago
thegodofwar / MR_HBase
View on GitHub
hadoop中Map/Reduce使用示例，输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作，增、删、查、改，使用Map/Reduce批量导入数据到HBase表中..…
☆14Apr 6, 2013Updated 13 years ago
chengkenli / StarRocksDevops
View on GitHub
StarRocks运维工具
☆17Sep 5, 2025Updated 10 months ago
helloworlde / k8s-spring-boot-demo
View on GitHub
Kubenetes with SpringBoot demo
☆10Feb 20, 2019Updated 7 years ago
pranav1699 / flink-iceberg-minio-trino
View on GitHub
This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…
☆25Jan 16, 2024Updated 2 years ago