sequenceiq/hadoop-docker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sequenceiq/hadoop-docker)

sequenceiq / hadoop-docker

Hadoop docker image

☆1,199

Alternatives and similar repositories for hadoop-docker

Users that are interested in hadoop-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sequenceiq / docker-spark
View on GitHub
☆757Mar 11, 2021Updated 5 years ago
kiwenlau / hadoop-cluster-docker
View on GitHub
Run Hadoop Custer within Docker Containers
☆1,819Jul 1, 2024Updated 2 years ago
sequenceiq / docker-hadoop-ubuntu
View on GitHub
A Hadoop image on Ubuntu
☆32Dec 8, 2014Updated 11 years ago
alvinhenrick / hadoop-mutinode
View on GitHub
☆57May 17, 2015Updated 11 years ago
gettyimages / docker-spark
View on GitHub
Docker build for Apache Spark
☆667Dec 30, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
big-data-europe / docker-hadoop
View on GitHub
Apache Hadoop docker image
☆2,325Feb 1, 2024Updated 2 years ago
big-data-europe / docker-hadoop-spark-workbench
View on GitHub
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…
☆701Oct 1, 2020Updated 5 years ago
sequenceiq / docker-pam
View on GitHub
☆33Oct 2, 2015Updated 10 years ago
sequenceiq / docker-ambari
View on GitHub
Docker image with Ambari
☆289Nov 21, 2017Updated 8 years ago
Lewuathe / docker-hadoop-cluster
View on GitHub
Multiple node cluster on Docker for self development.
☆91Jul 7, 2018Updated 8 years ago
wurstmeister / kafka-docker
View on GitHub
Dockerfile for Apache Kafka
☆6,965May 8, 2024Updated 2 years ago
big-data-europe / docker-spark
View on GitHub
Apache Spark docker image
☆2,050Apr 20, 2026Updated 3 months ago
bigdatafoundation / docker-hadoop
View on GitHub
Dockerfile for running Hadoop on Ubuntu
☆93Jan 12, 2024Updated 2 years ago
cloudera / hue
View on GitHub
Open source SQL Query Assistant service for Databases/Warehouses
☆1,413Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sequenceiq / docker-spark-native-yarn
View on GitHub
☆13Mar 8, 2018Updated 8 years ago
mesos / hadoop
View on GitHub
Hadoop on Mesos
☆176Oct 4, 2022Updated 3 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,666Updated this week
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
elastic / elasticsearch-hadoop
View on GitHub
Elasticsearch real-time search and analytics natively integrated with Hadoop
☆1,974Updated this week
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,645Updated this week
yahoo / CMAK
View on GitHub
CMAK is a tool for managing Apache Kafka clusters
☆11,927Aug 2, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
docker-archive / classicswarm
View on GitHub
Swarm Classic: a container clustering system. Not to be confused with Docker Swarm which is at https://github.com/docker/swarmkit
☆5,731Jun 11, 2020Updated 6 years ago
d2iq-archive / marathon
View on GitHub
Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
☆4,031Sep 8, 2022Updated 3 years ago
dajobe / hbase-docker
View on GitHub
HBase running in Docker
☆333Sep 27, 2022Updated 3 years ago
spotify / docker-kafka
View on GitHub
Kafka (and Zookeeper) in Docker
☆1,384Dec 18, 2019Updated 6 years ago
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
krejcmat / hadoop-hbase-docker
View on GitHub
HBase on distributed cluster based on Hadoop. Automatized images builds and deploying cluster.
☆74Feb 22, 2018Updated 8 years ago
big-data-europe / docker-hive
View on GitHub
☆1,081Jun 2, 2024Updated 2 years ago
HariSekhon / Dockerfiles
View on GitHub
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…
☆1,380Feb 3, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sequenceiq / periscope
View on GitHub
Periscope brings SLA policy based autoscaling to Hadoop
☆35Jan 25, 2016Updated 10 years ago
tomwhite / hadoop-book
View on GitHub
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,500Mar 17, 2020Updated 6 years ago
JerryLead / SparkInternals
View on GitHub
Notes talking about the design and implementation of Apache Spark
☆5,361Apr 2, 2024Updated 2 years ago
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,034Updated this week
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Oct 5, 2022Updated 3 years ago
analytically / hadoop-ansible
View on GitHub
Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch a…
☆417Sep 22, 2016Updated 9 years ago
jupyter / docker-stacks
View on GitHub
Ready-to-run Docker images containing Jupyter applications
☆8,446Updated this week