mvillarrealb / docker-spark-clusterView external linksLinks
A simple spark standalone cluster for your testing environment purposses
☆569Mar 6, 2024Updated last year
Alternatives and similar repositories for docker-spark-cluster
Users that are interested in docker-spark-cluster are comparing it to the libraries listed below
Sorting:
- Apache Spark docker image☆2,058Apr 21, 2023Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆507Nov 7, 2025Updated 3 months ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Jul 20, 2021Updated 4 years ago
- Docker with Airflow and Spark standalone cluster☆262Aug 5, 2023Updated 2 years ago
- Docker build for Apache Spark☆672Dec 30, 2021Updated 4 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆698Oct 1, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- Spark + HDFS cluster using docker compose☆48Nov 6, 2018Updated 7 years ago
- spark on kubernetes☆104Feb 20, 2023Updated 2 years ago
- A Spark cluster setup running on Docker containers☆61Dec 26, 2019Updated 6 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 4 years ago
- java语言系统性刷过的算法题☆16Apr 25, 2025Updated 9 months ago
- ☆1,080Jun 2, 2024Updated last year
- ☆25Mar 15, 2024Updated last year
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Dec 7, 2020Updated 5 years ago
- Docker Apache Airflow☆3,814Mar 1, 2023Updated 2 years ago
- Apache Hadoop docker image☆2,312Feb 1, 2024Updated 2 years ago
- Spark cluster in docker containers with sample training Jupyter notebooks☆27Feb 24, 2023Updated 2 years ago
- An Efficient MoE by Orchestrating Atomic Experts at Scale☆99Feb 8, 2026Updated last week
- Spark on Kubernetes infrastructure Helm charts repo☆203Oct 20, 2022Updated 3 years ago
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Apr 4, 2019Updated 6 years ago
- Official Dockerfile for Apache Spark☆165Feb 5, 2026Updated last week
- Deploy your Spark Production Cluster on Kubernetes☆46Sep 13, 2020Updated 5 years ago
- A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lak…☆34Apr 17, 2024Updated last year
- Kafka streaming with Spark and Flink example☆31Jul 16, 2023Updated 2 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆682Mar 6, 2025Updated 11 months ago
- Demo on how to integrate Spring Data JPA, Apache Spark and GraphX with Java and Scala mixed codes☆19May 14, 2018Updated 7 years ago
- Flowchart for debugging Spark applications☆106Sep 25, 2024Updated last year
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,575Feb 10, 2026Updated last week
- CalData infrastructure☆22Updated this week
- The official repository for the Rock the JVM Spark Essentials with Scala course☆278Sep 10, 2025Updated 5 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Jan 31, 2023Updated 3 years ago
- Demo Spark application to transform data gathered on sensors for a heatmap application☆33May 29, 2017Updated 8 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆42,810Updated this week
- Testes de código para integrar, futuramente, o Radar Legislativo☆21Jul 29, 2022Updated 3 years ago
- The directed brute force cracking tool, after collecting information, uses it to generate a special dictionary containing the feature inf…☆37Aug 7, 2023Updated 2 years ago
- This is a pytcli. (A command line for python toollib package)☆108Jul 9, 2022Updated 3 years ago
- 此工程采用SpringBoot + Mybatis + SparkSQL + Hive框架进行集成,支持Kerberos认证。☆21Mar 19, 2018Updated 7 years ago