Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn
☆51Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for docker-spark-yarn-cluster
Users that are interested in docker-spark-yarn-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark on Apache Yarn 2.6.0 cluster Docker image☆11Oct 18, 2017Updated 8 years ago
- Lets Airflow DAGs run Spark jobs via Livy: sessions and/or batches.☆18May 23, 2023Updated 2 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!☆23May 20, 2025Updated 10 months ago
- A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.☆16Jan 3, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository containing Docker images for Spark master and slave☆15Nov 3, 2019Updated 6 years ago
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.☆23Nov 3, 2015Updated 10 years ago
- Machine Learning with Scala Quick Start Guide, published by Packt☆24Jul 20, 2023Updated 2 years ago
- Simple publisher and subscriber examples for Kombu and Pika with a RabbitMQ broker☆10Mar 23, 2018Updated 8 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 9 years ago
- Sammyjs.org☆19Jul 27, 2015Updated 10 years ago
- Kafka streaming with Spark and Flink example☆31Jul 16, 2023Updated 2 years ago
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Accedo Control SDK for Node.js and browsers☆11Jul 17, 2023Updated 2 years ago
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- A Spark cluster setup running on Docker containers☆61Dec 26, 2019Updated 6 years ago
- Apache Spark docker image☆2,052Apr 21, 2023Updated 2 years ago
- ☆15Sep 16, 2015Updated 10 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- Unofficial embeddable Stackoverflow profile summary card☆11Nov 19, 2022Updated 3 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆84Nov 13, 2022Updated 3 years ago
- ☆11Dec 19, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple hotel reservation system☆18Jan 20, 2022Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Docker image for Apache Hive running on Tez☆25Apr 24, 2015Updated 10 years ago
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆22Mar 21, 2021Updated 5 years ago
- Scripts involved in getting data for ingredient substitutions☆16Feb 14, 2023Updated 3 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 2 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 6 years ago
- Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR data into a SQL Database for analysis☆22Updated this week
- "The history of astronomy is a history of receding horizons."― Edwin Powell Hubble☆12Jan 20, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 5 months ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Java IDE Pack for VS Code - All Awesome extentions☆12Oct 12, 2018Updated 7 years ago
- SegTrackDetect - A framework for ROI-based Tiny Object Detection at full resolution.☆11Jan 29, 2025Updated last year
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- Docker-compose contains the most common big data systems like: Apache Hadoop, Apache Hive, Apache Spark, Jupyter, Flink☆29Oct 9, 2023Updated 2 years ago
- list if cities in israel☆10Mar 5, 2018Updated 8 years ago