PierreKieffer/docker-spark-yarn-cluster

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PierreKieffer/docker-spark-yarn-cluster)

PierreKieffer / docker-spark-yarn-cluster

Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn

☆50

Alternatives and similar repositories for docker-spark-yarn-cluster

Users that are interested in docker-spark-yarn-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

junk16 / spark-yarn-cluster
View on GitHub
Apache Spark on Apache Yarn 2.6.0 cluster Docker image
☆12Oct 18, 2017Updated 8 years ago
panovvv / hadoop-hive-spark-docker
View on GitHub
Base Docker image with just essentials: Hadoop, Hive and Spark.
☆67Feb 3, 2021Updated 5 years ago
mohsenasm / spark-on-yarn-cluster
View on GitHub
A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.
☆16Jan 3, 2024Updated 2 years ago
mjmlbook / mastering-java-machine-learning
View on GitHub
Experiments, results and additional material from "Mastering Java Machine Learning" (PACKT Publishing)
☆14Jul 10, 2017Updated 9 years ago
PacktPublishing / Machine-Learning-with-Scala-Quick-Start-Guide
View on GitHub
Machine Learning with Scala Quick Start Guide, published by Packt
☆24Jul 20, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
panovvv / bigdata-docker-compose
View on GitHub
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
☆168Feb 4, 2021Updated 5 years ago
jay-johnson / kombu-and-pika-pub-sub-examples
View on GitHub
Simple publisher and subscriber examples for Kombu and Pika with a RabbitMQ broker
☆10Mar 23, 2018Updated 8 years ago
oscar-martin / docker-spark-hbase-yarn
View on GitHub
A dockerized small bigdata cluster to play with
☆13Jun 14, 2016Updated 10 years ago
pchanumolu / Spark-Streaming-Apache-Kafka-Apache-HBase
View on GitHub
Spark Streaming example project which pulls messages from Kafka and write to HBase Table.
☆11Jul 5, 2015Updated 11 years ago
Accedo-Global-Solutions / accedo-control-sdk-js
View on GitHub
Accedo Control SDK for Node.js and browsers
☆11Jul 17, 2023Updated 3 years ago
google / fhir-dbt-utils
View on GitHub
Utility functions to support analytics over FHIR in BigQuery or Apache Spark
☆15Jan 8, 2024Updated 2 years ago
rubenafo / docker-spark-cluster
View on GitHub
A Spark cluster setup running on Docker containers
☆61Dec 26, 2019Updated 6 years ago
crosscite / doi-metadata-search
View on GitHub
Frontend for CrossRef and DataCite Metadata Search
☆12May 18, 2024Updated 2 years ago
big-data-europe / docker-spark
View on GitHub
Apache Spark docker image
☆2,050Apr 20, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dacort / duckdb-athena-extension
View on GitHub
An experimental Athena extension for DuckDB 🐤
☆57Dec 31, 2024Updated last year
mudassir0909 / stackoverflow-card
View on GitHub
Unofficial embeddable Stackoverflow profile summary card
☆11Nov 19, 2022Updated 3 years ago
Vinayak-D / HotelReservation
View on GitHub
A simple hotel reservation system
☆19Jan 20, 2022Updated 4 years ago
amesar / docker-spark-hive-metastore
View on GitHub
Spark and Hive docker containers sharing a common MySQL metastore
☆26Apr 17, 2020Updated 6 years ago
alialamiidrissi / ADA_Course_Project
View on GitHub
☆11Dec 19, 2017Updated 8 years ago
cclient / kubernetes-hadoop
View on GitHub
k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境，很早的项目自已用，腾讯tbds培训，以此为基础(多了一个kafka/flink)搭一套环境练习，又捡起来了
☆21Mar 21, 2021Updated 5 years ago
lrjxgl / mysql8cn
View on GitHub
mysql8.0官方手册文档 mysql8.0中文手册文档
☆11Jun 4, 2019Updated 7 years ago
HsiehShuJeng / cdk-emrserverless-with-delta-lake
View on GitHub
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…
☆11Nov 18, 2025Updated 8 months ago
ornicar / lichs
View on GitHub
♟Play chess against real players in your terminal using Lichess
☆10May 27, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
big-data-europe / docker-hive-metastore-postgresql
View on GitHub
Postgresql configured to work as metastore for Hive.
☆32Dec 16, 2022Updated 3 years ago
xuanzhao / imooc_spark_log_analysis
View on GitHub
以慕课网日志分析为例进入大数据 Spark SQL 的世界
☆15Apr 3, 2018Updated 8 years ago
enkhalifapro / bigdata-all-in-one
View on GitHub
Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink
☆28Oct 9, 2023Updated 2 years ago
bitlap / smt
View on GitHub
🔧 Useless but cool.
☆14Mar 13, 2025Updated last year
tuya / webrtc-demo-go
View on GitHub
Tuya WebRTC Web Sample
☆14Feb 9, 2023Updated 3 years ago
sciencepal / dockers
View on GitHub
Code for docker images
☆39Apr 12, 2023Updated 3 years ago
moodymudskipper / pkg
View on GitHub
Package Objects
☆12Jun 5, 2025Updated last year
royts / israel-cities
View on GitHub
list if cities in israel
☆10Mar 5, 2018Updated 8 years ago
PacktPublishing / Hands-On-Scala-Programming
View on GitHub
Hands-On Scala Programming [Video], published by Packt
☆13Oct 31, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
antlypls / spark-kafka-docker-demo
View on GitHub
A sample project shows how to run Spark Streaming app with Kafka in Docker
☆35Oct 25, 2017Updated 8 years ago
spancer / bigdata-docker-compose
View on GitHub
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
☆150Sep 23, 2024Updated last year
lensesio / datagen
View on GitHub
A small project to allow publishing data to Apache Kafka, Apache Pulsar or any other target system
☆16Sep 21, 2020Updated 5 years ago
mayanhui / hadoop-hbase-examples
View on GitHub
hadoop hbase use case and examples, inclusing MR,HBaseUtil...
☆35Sep 18, 2013Updated 12 years ago
ansible-collections / community.rabbitmq
View on GitHub
Manage RabbitMQ with Ansible
☆34Jun 22, 2026Updated last month
ikram-shah / fhir-ai-and-openapi-chain
View on GitHub
An application to interact with your FHIR API's using natural language query
☆16Jul 6, 2023Updated 3 years ago
aws-samples / aws-emr-serverless-using-terraform
View on GitHub
☆15Dec 19, 2025Updated 7 months ago