Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines
☆134Nov 4, 2022Updated 3 years ago
Alternatives and similar repositories for docker-spark
Users that are interested in docker-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆507Nov 7, 2025Updated 5 months ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated 2 years ago
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 6 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- Set up a 3 node spark cluster using docker containers☆34Mar 23, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- spark on kubernetes☆104Feb 20, 2023Updated 3 years ago
- running apache spark with docker swarm☆34Feb 25, 2021Updated 5 years ago
- Apache Spark docker image☆2,052Apr 21, 2023Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- Kafka streaming with Spark and Flink example☆31Jul 16, 2023Updated 2 years ago
- Ansible playbooks for deploying a 3 node Kubernetes cluster☆23Nov 24, 2023Updated 2 years ago
- ☆11Jul 13, 2020Updated 5 years ago
- Helm Charts to Deploy Apache Drill on Kubernetes☆17Jan 5, 2024Updated 2 years ago
- ☆28May 13, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago
- ☆16Aug 23, 2022Updated 3 years ago
- An example of Spark and GraphX with Twitter as sample☆19Dec 29, 2016Updated 9 years ago
- Create a streaming pipeline using Kafka and Kafka Connect☆14Jun 29, 2020Updated 5 years ago
- R model API to support bucketing and masking☆12Oct 9, 2018Updated 7 years ago
- 使用容器搭建大数据架构微服务☆13Nov 28, 2017Updated 8 years ago
- Scrapes and Analyzes/Compares Nov 1 & Jun 7, 2015 General Election Results in Turkey☆12Nov 19, 2015Updated 10 years ago
- ☆21May 13, 2025Updated 11 months ago
- Use Airflow to pull in remote data via API, pub/sub, kinesis, s3 etc. and then store it in s3 for later consumption by other services.☆13Mar 14, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Feb 19, 2022Updated 4 years ago
- Mirror of Apache livy (Incubating)☆13Feb 8, 2024Updated 2 years ago
- Autocomplete / Autofill Text field with Dropdown menu to choose between suggested values from a given list.☆14Feb 23, 2024Updated 2 years ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- Generate cloud-init ready vm images via packer and deploy these via terraform.☆16Jan 6, 2026Updated 3 months ago
- Opinionated Devops for R Data Products Strictly Without Magic☆14Jan 20, 2025Updated last year
- Backtesting.py is an open-source backtesting Python library that allows users to test their trading strategies via code.☆21Feb 18, 2024Updated 2 years ago
- CLI Tool for quickly loading file-based datasets into PostgreSQL/PostGIS☆12Apr 22, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example of a Spring Boot project with a React frontend☆13Apr 5, 2019Updated 7 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- Build idbloader.img, trust.img, and uboot.img from compiled uboot☆10Feb 17, 2023Updated 3 years ago
- Demo code showing how to use Java's StructuredTaskScope☆12Dec 10, 2025Updated 4 months ago
- Seed CouchDB design documents☆11Apr 12, 2020Updated 6 years ago
- download the esri js api☆19Dec 18, 2015Updated 10 years ago
- R package for prepping and analyzing DataHaven's 2019 Community Index☆12Jan 29, 2026Updated 2 months ago