Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn
☆50Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for docker-spark-yarn-cluster
Users that are interested in docker-spark-yarn-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark on Apache Yarn 2.6.0 cluster Docker image☆12Oct 18, 2017Updated 8 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆67Feb 3, 2021Updated 5 years ago
- A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.☆16Jan 3, 2024Updated 2 years ago
- Machine Learning with Scala Quick Start Guide, published by Packt☆24Jul 20, 2023Updated 2 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Feb 4, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark Streaming example project which pulls messages from Kafka and write to HBase Table.☆11Jul 5, 2015Updated 10 years ago
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 10 years ago
- A Spark cluster setup running on Docker containers☆61Dec 26, 2019Updated 6 years ago
- Apache Spark docker image☆2,051Apr 20, 2026Updated last month
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- A Hadoop cluster based on Docker, including Hive and Spark.☆82Nov 13, 2022Updated 3 years ago
- An experimental Athena extension for DuckDB 🐤☆57Dec 31, 2024Updated last year
- A simple hotel reservation system☆19Jan 20, 2022Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Scripts involved in getting data for ingredient substitutions☆16Feb 14, 2023Updated 3 years ago
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆21Mar 21, 2021Updated 5 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 3 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 6 years ago
- ♟Play chess against real players in your terminal using Lichess☆10May 27, 2020Updated 6 years ago
- "The history of astronomy is a history of receding horizons."― Edwin Powell Hubble☆13Jan 20, 2021Updated 5 years ago
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- 以慕课网日志分析为例 进入大数据 Spark SQL 的世界☆15Apr 3, 2018Updated 8 years ago
- SegTrackDetect - A framework for ROI-based Tiny Object Detection at full resolution.☆11Jan 29, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- Orchestration, Management and Monitoring of Data Processing☆11Updated this week
- list if cities in israel☆10Mar 5, 2018Updated 8 years ago
- Tuya WebRTC Web Sample☆14Feb 9, 2023Updated 3 years ago
- Package Objects☆12Jun 5, 2025Updated last year
- A sample project shows how to run Spark Streaming app with Kafka in Docker☆35Oct 25, 2017Updated 8 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆150Sep 23, 2024Updated last year
- A small project to allow publishing data to Apache Kafka, Apache Pulsar or any other target system☆16Sep 21, 2020Updated 5 years ago
- hadoop hbase use case and examples, inclusing MR,HBaseUtil...☆35Sep 18, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Manage RabbitMQ with Ansible☆34Updated this week
- Simulator for HDD/SSD, derived from the CMU PDL DiskSim, with the SSD-add-on patch from Microsoft Research applied.☆15Dec 30, 2019Updated 6 years ago
- Creates a custom RabbitMQ container with a preconfigured vhost (example-vhost), exchange (example-exchange), and queue (example-queue) wi…☆31May 5, 2020Updated 6 years ago
- This repository is a mirror of git://git.kernel.dk/blktrace.git☆15Jun 12, 2016Updated 10 years ago
- Work In Progress: GitHub API v3.0 implemented in R using the gh package☆18Dec 7, 2023Updated 2 years ago
- Code and Manuscript for the ECCV 4th Workshop on Computer Vision for Art Analysis "Deep Transfer Learning for Art Classification Problem…☆14Sep 6, 2018Updated 7 years ago
- slides and examples from talks☆19Jun 14, 2018Updated 8 years ago