big-data-europe/docker-hadoop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/big-data-europe/docker-hadoop)

big-data-europe / docker-hadoop

Apache Hadoop docker image

☆2,327

Alternatives and similar repositories for docker-hadoop

Users that are interested in docker-hadoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

big-data-europe / docker-hive
View on GitHub
☆1,081Jun 2, 2024Updated 2 years ago
big-data-europe / docker-spark
View on GitHub
Apache Spark docker image
☆2,050Apr 20, 2026Updated 3 months ago
big-data-europe / docker-hbase
View on GitHub
☆250Nov 15, 2022Updated 3 years ago
big-data-europe / docker-hadoop-spark-workbench
View on GitHub
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…
☆701Oct 1, 2020Updated 5 years ago
big-data-europe / docker-flink
View on GitHub
Apache Flink docker image
☆196Jul 1, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kiwenlau / hadoop-cluster-docker
View on GitHub
Run Hadoop Custer within Docker Containers
☆1,819Jul 1, 2024Updated 2 years ago
big-data-europe / docker-kafka
View on GitHub
☆31Mar 7, 2018Updated 8 years ago
big-data-europe / docker-zeppelin
View on GitHub
☆26Nov 22, 2022Updated 3 years ago
big-data-europe / docker-hive-metastore-postgresql
View on GitHub
Postgresql configured to work as metastore for Hive.
☆32Dec 16, 2022Updated 3 years ago
sequenceiq / hadoop-docker
View on GitHub
Hadoop docker image
☆1,197Jun 25, 2020Updated 6 years ago
bambrow / docker-hadoop-workbench
View on GitHub
A Hadoop cluster based on Docker, including Hive and Spark.
☆83Nov 13, 2022Updated 3 years ago
ruoyu-chen / hadoop-docker
View on GitHub
基于Docker构建的Hadoop开发测试环境，包含Hadoop，Hive，HBase，Spark
☆307May 26, 2019Updated 7 years ago
spancer / bigdata-docker-compose
View on GitHub
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
☆150Sep 23, 2024Updated last year
HariSekhon / Dockerfiles
View on GitHub
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…
☆1,380Feb 3, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Marcel-Jan / docker-hadoop-spark
View on GitHub
Multi-container environment with Hadoop, Spark and Hive
☆235May 5, 2025Updated last year
flokkr / docker-hadoop
View on GitHub
Docker image for main Apache Hadoop components (Yarn/Hdfs)
☆56Dec 10, 2022Updated 3 years ago
wurstmeister / kafka-docker
View on GitHub
Dockerfile for Apache Kafka
☆6,963May 8, 2024Updated 2 years ago
panovvv / bigdata-docker-compose
View on GitHub
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
☆168Feb 4, 2021Updated 5 years ago
msemn / bd-infra
View on GitHub
☆90Sep 24, 2022Updated 3 years ago
GezimSejdiu / flink-starter
View on GitHub
Apache Flink demo example
☆17Jan 10, 2019Updated 7 years ago
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,197Updated this week
flokkr / runtime-compose
View on GitHub
Examples to run Hadoop/Spark clusters locally with docker-compose.
☆36Sep 2, 2018Updated 7 years ago
big-data-europe / docker-hdfs-filebrowser
View on GitHub
A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.
☆11Sep 20, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
panovvv / hadoop-hive-spark-docker
View on GitHub
Base Docker image with just essentials: Hadoop, Hive and Spark.
☆67Feb 3, 2021Updated 5 years ago
elek / flokkr-runtime-kubernetes
View on GitHub
Examples to run Hadoop/Spark cluster with kubernetes.
☆12Feb 10, 2019Updated 7 years ago
big-data-europe / README
View on GitHub
General README for the Big Data Europe project's sources
☆84Sep 24, 2023Updated 2 years ago
apache / hadoop
View on GitHub
Apache Hadoop
☆15,618Updated this week
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,716Updated this week
apache / flink
View on GitHub
Apache Flink
☆26,222Updated this week
apache / dolphinscheduler
View on GitHub
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
☆14,401Updated this week
bigdatafoundation / docker-hadoop
View on GitHub
Dockerfile for running Hadoop on Ubuntu
☆93Jan 12, 2024Updated 2 years ago
cloudera / hue
View on GitHub
Open source SQL Query Assistant service for Databases/Warehouses
☆1,415Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Lewuathe / docker-hadoop-cluster
View on GitHub
Multiple node cluster on Docker for self development.
☆91Jul 7, 2018Updated 8 years ago
puckel / docker-airflow
View on GitHub
Docker Apache Airflow
☆3,810Mar 1, 2023Updated 3 years ago
apache / seatunnel
View on GitHub
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
☆9,511Updated this week
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,365Updated this week
apache / flink-cdc
View on GitHub
Flink CDC is a streaming data integration tool
☆6,450Updated this week
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,719Updated this week
DataLinkDC / dinky
View on GitHub
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
☆3,745Updated this week