Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines
☆134Nov 4, 2022Updated 3 years ago
Alternatives and similar repositories for docker-spark
Users that are interested in docker-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆508Nov 7, 2025Updated 6 months ago
- A simple spark standalone cluster for your testing environment purposses☆567Mar 6, 2024Updated 2 years ago
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 6 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta…☆19May 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- spark on kubernetes☆104Feb 20, 2023Updated 3 years ago
- running apache spark with docker swarm☆34Feb 25, 2021Updated 5 years ago
- MySQL Binlog based Changed Data Capture☆11Apr 26, 2017Updated 9 years ago
- Apache Spark docker image☆2,049Apr 20, 2026Updated last month
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- Kafka streaming with Spark and Flink example☆31Jul 16, 2023Updated 2 years ago
- Tools and specifications for Semantic Data Dictionaries☆12May 21, 2026Updated last week
- ☆11Jul 13, 2020Updated 5 years ago
- ☆32Aug 13, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆29May 13, 2025Updated last year
- ☆18Apr 6, 2025Updated last year
- An example of Spark and GraphX with Twitter as sample☆19Dec 29, 2016Updated 9 years ago
- ☆13Feb 3, 2026Updated 3 months ago
- Create a streaming pipeline using Kafka and Kafka Connect☆14Jun 29, 2020Updated 5 years ago
- R model API to support bucketing and masking☆12Oct 9, 2018Updated 7 years ago
- ☆21May 13, 2025Updated last year
- Step by step tutorial for those who have zero knowledge to Amazon EKS☆14Mar 11, 2026Updated 2 months ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Provider for AWS Redshift entities, eg Users, Groups, Permissions, Schemas, Databases☆47Mar 10, 2022Updated 4 years ago
- ☆19Apr 5, 2023Updated 3 years ago
- Collection of AWS Lambda functions in Python☆11Mar 13, 2019Updated 7 years ago
- Autocomplete / Autofill Text field with Dropdown menu to choose between suggested values from a given list.☆14Feb 23, 2024Updated 2 years ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- Generate cloud-init ready vm images via packer and deploy these via terraform.☆16Jan 6, 2026Updated 4 months ago
- Opinionated Devops for R Data Products Strictly Without Magic☆14Jan 20, 2025Updated last year
- Scripts and code written whilst learning and experimenting with machine learning☆13Jul 18, 2022Updated 3 years ago
- Publish your Kubernetes Helm Charts on GitHub Pages. DEPRECATED: please use https://github.com/helm/chart-releaser☆23Jul 4, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CLI Tool for quickly loading file-based datasets into PostgreSQL/PostGIS☆12Apr 22, 2017Updated 9 years ago
- Some data science applications on the student mathematics performance data set from the 2010 KDD Cup.☆10Nov 27, 2014Updated 11 years ago
- Spark job for compacting avro files together☆12Jan 26, 2018Updated 8 years ago
- Build idbloader.img, trust.img, and uboot.img from compiled uboot☆10Feb 17, 2023Updated 3 years ago
- Seed CouchDB design documents☆11Apr 12, 2020Updated 6 years ago
- download the esri js api☆19Dec 18, 2015Updated 10 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago