Apache Spark docker image
☆2,052Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for docker-spark
Users that are interested in docker-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Hadoop docker image☆2,318Feb 1, 2024Updated 2 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆699Oct 1, 2020Updated 5 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated 2 years ago
- Apache Flink docker image☆196Jul 1, 2022Updated 3 years ago
- ☆1,079Jun 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Docker build for Apache Spark☆671Dec 30, 2021Updated 4 years ago
- ☆32Mar 7, 2018Updated 8 years ago
- ☆250Nov 15, 2022Updated 3 years ago
- Demo Spark application to transform data gathered on sensors for a heatmap application☆33May 29, 2017Updated 8 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆507Nov 7, 2025Updated 5 months ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,111Apr 6, 2026Updated last week
- Ready-to-run Docker images containing Jupyter applications☆8,427Updated this week
- ☆760Mar 11, 2021Updated 5 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆43,098Apr 9, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,746Updated this week
- Docker Apache Airflow☆3,807Mar 1, 2023Updated 3 years ago
- Dockerfile for Apache Kafka☆6,976May 8, 2024Updated last year
- ☆26Nov 22, 2022Updated 3 years ago
- Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn☆51Dec 7, 2020Updated 5 years ago
- 50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian,…☆1,376Feb 3, 2026Updated 2 months ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Base classes to use when writing tests with Spark☆1,551Updated this week
- Hadoop docker image☆1,202Jun 25, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Apache Spark docker container image (Standalone mode)☆35Oct 16, 2020Updated 5 years ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- A curated list of awesome Apache Spark packages and resources.☆1,869Feb 27, 2026Updated last month
- Code for docker images☆39Apr 12, 2023Updated 3 years ago
- spark on kubernetes☆104Feb 20, 2023Updated 3 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆134Nov 4, 2022Updated 3 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- Upserts, Deletes And Incremental Processing on Big Data.☆6,139Updated this week
- A connector for Spark that allows reading and writing to/from Redis cluster☆947Oct 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Docker image for Apache Spark☆76Nov 8, 2019Updated 6 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated last month
- Jupyter magics and kernels for working with remote Spark clusters☆1,361Sep 9, 2025Updated 7 months ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- The Metadata Platform for your Data and AI Stack☆11,775Apr 9, 2026Updated last week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,964Apr 9, 2026Updated last week
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,860Jul 10, 2023Updated 2 years ago