[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
☆699Oct 1, 2020Updated 5 years ago
Alternatives and similar repositories for docker-hadoop-spark-workbench
Users that are interested in docker-hadoop-spark-workbench are comparing it to the libraries listed below
Sorting:
- Apache Spark docker image☆2,059Apr 21, 2023Updated 2 years ago
- Apache Hadoop docker image☆2,313Feb 1, 2024Updated 2 years ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Zeppelin docker☆16Nov 16, 2020Updated 5 years ago
- Docker build for Apache Spark☆671Dec 30, 2021Updated 4 years ago
- ☆26Nov 22, 2022Updated 3 years ago
- Apache Flink docker image☆197Jul 1, 2022Updated 3 years ago
- ☆1,081Jun 2, 2024Updated last year
- ☆32Mar 7, 2018Updated 8 years ago
- Run Hadoop Custer within Docker Containers☆1,828Jul 1, 2024Updated last year
- Spark + HDFS cluster using docker compose☆48Nov 6, 2018Updated 7 years ago
- Documentation placeholder and utilities for all the other containers.☆29May 1, 2020Updated 5 years ago
- ☆252Nov 15, 2022Updated 3 years ago
- Hadoop docker image☆1,206Jun 25, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆308May 26, 2019Updated 6 years ago
- General README for the Big Data Europe project's sources☆83Sep 24, 2023Updated 2 years ago
- ☆761Mar 11, 2021Updated 4 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- Examples to run Hadoop/Spark clusters locally with docker-compose.☆36Sep 2, 2018Updated 7 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆55Dec 10, 2022Updated 3 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176May 28, 2025Updated 9 months ago
- Multiple node cluster on Docker for self development.☆92Jul 7, 2018Updated 7 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated 2 years ago
- 50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…☆1,377Feb 3, 2026Updated last month
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated 2 months ago
- Ansible playbooks to construct distributed computing environments☆62Jun 6, 2021Updated 4 years ago
- Ready-to-run Docker images containing Jupyter applications☆8,424Updated this week
- A Hadoop cluster based on Docker, including Hive and Spark.☆83Nov 13, 2022Updated 3 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 6 months ago
- hadoop-spark-hive-cluster-docker☆52Nov 3, 2017Updated 8 years ago
- Demo Spark application to transform data gathered on sensors for a heatmap application☆33May 29, 2017Updated 8 years ago
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆220Nov 30, 2019Updated 6 years ago
- Dockerfile for Apache Kafka☆6,980May 8, 2024Updated last year
- Apache HBase docker image based on alpine☆62Nov 10, 2018Updated 7 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago