A simple spark standalone cluster for your testing environment purposses
☆568Mar 6, 2024Updated 2 years ago
Alternatives and similar repositories for docker-spark-cluster
Users that are interested in docker-spark-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark docker image☆2,052Apr 21, 2023Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆508Nov 7, 2025Updated 5 months ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆134Nov 4, 2022Updated 3 years ago
- Apache Spark docker container image (Standalone mode)☆35Oct 16, 2020Updated 5 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Jul 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆700Oct 1, 2020Updated 5 years ago
- Docker build for Apache Spark☆669Dec 30, 2021Updated 4 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- ☆25Mar 15, 2024Updated 2 years ago
- Spark cluster in docker containers with sample training Jupyter notebooks☆27Feb 24, 2023Updated 3 years ago
- Spark + HDFS cluster using docker compose☆48Nov 6, 2018Updated 7 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- spark on kubernetes☆104Feb 20, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 6 years ago
- Official Dockerfile for Apache Spark☆166Feb 18, 2026Updated 2 months ago
- Docker Apache Airflow☆3,807Mar 1, 2023Updated 3 years ago
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Dec 7, 2020Updated 5 years ago
- Predicting the Remaining Useful Life (RUL) of simulated Turbofan Engines using Spark ML, Spark Structured Streaming, and Kafka.☆26Oct 15, 2024Updated last year
- java语言系统性刷过的算法题☆16Apr 25, 2025Updated 11 months ago
- Apache Hadoop docker image☆2,318Feb 1, 2024Updated 2 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆202Oct 20, 2022Updated 3 years ago
- ☆1,078Jun 2, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Deploy your Spark Production Cluster on Kubernetes☆46Sep 13, 2020Updated 5 years ago
- Multi-container environment with Hadoop, Spark and Hive☆233May 5, 2025Updated 11 months ago
- A Spark cluster setup running on Docker containers☆61Dec 26, 2019Updated 6 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,753Updated this week
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- An Efficient MoE by Orchestrating Atomic Experts at Scale☆107Mar 26, 2026Updated 3 weeks ago
- CalData infrastructure☆24Apr 7, 2026Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Jan 31, 2023Updated 3 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆11Nov 25, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official repository for the Rock the JVM Spark Essentials with Scala course☆277Mar 14, 2026Updated last month
- Apache Spark - A unified analytics engine for large-scale data processing☆43,144Updated this week
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆184Nov 23, 2023Updated 2 years ago
- Set up a 3 node spark cluster using docker containers☆34Mar 23, 2018Updated 8 years ago
- The Internals of Spark SQL☆488Jan 25, 2026Updated 2 months ago
- How to setup a minimal Hadoop cluster using Docker☆11Mar 13, 2022Updated 4 years ago
- Lecture: Big Data☆14Oct 27, 2025Updated 5 months ago