big-data-europe / docker-hadoop-spark-workbenchView external linksLinks
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
☆698Oct 1, 2020Updated 5 years ago
Alternatives and similar repositories for docker-hadoop-spark-workbench
Users that are interested in docker-hadoop-spark-workbench are comparing it to the libraries listed below
Sorting:
- Apache Spark docker image☆2,058Apr 21, 2023Updated 2 years ago
- Apache Hadoop docker image☆2,312Feb 1, 2024Updated 2 years ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Zeppelin docker☆16Nov 16, 2020Updated 5 years ago
- Docker build for Apache Spark☆672Dec 30, 2021Updated 4 years ago
- ☆26Nov 22, 2022Updated 3 years ago
- Apache Flink docker image☆197Jul 1, 2022Updated 3 years ago
- ☆1,080Jun 2, 2024Updated last year
- Run Hadoop Custer within Docker Containers☆1,829Jul 1, 2024Updated last year
- Spark + HDFS cluster using docker compose☆48Nov 6, 2018Updated 7 years ago
- ☆251Nov 15, 2022Updated 3 years ago
- Hadoop docker image☆1,208Jun 25, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- 基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark☆307May 26, 2019Updated 6 years ago
- General README for the Big Data Europe project's sources☆83Sep 24, 2023Updated 2 years ago
- ☆761Mar 11, 2021Updated 4 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- Examples to run Hadoop/Spark clusters locally with docker-compose.☆36Sep 2, 2018Updated 7 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Dec 10, 2022Updated 3 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176May 28, 2025Updated 8 months ago
- Multiple node cluster on Docker for self development.☆92Jul 7, 2018Updated 7 years ago
- A simple spark standalone cluster for your testing environment purposses☆569Mar 6, 2024Updated last year
- 50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…☆1,376Feb 3, 2026Updated 2 weeks ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,151May 16, 2023Updated 2 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated last month
- Ready-to-run Docker images containing Jupyter applications☆8,412Feb 8, 2026Updated last week
- Ansible playbooks to construct distributed computing environments☆62Jun 6, 2021Updated 4 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆83Nov 13, 2022Updated 3 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,363Sep 9, 2025Updated 5 months ago
- hadoop-spark-hive-cluster-docker☆52Nov 3, 2017Updated 8 years ago
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Nov 30, 2019Updated 6 years ago
- Dockerfile for Apache Kafka☆6,981May 8, 2024Updated last year
- ☆13Feb 14, 2016Updated 10 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆150Sep 23, 2024Updated last year
- Docker Apache Airflow☆3,814Mar 1, 2023Updated 2 years ago
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 9 years ago
- Docker images to run cloudera cluster☆12May 16, 2018Updated 7 years ago