Multi-container environment with Hadoop, Spark and Hive
☆232May 5, 2025Updated 10 months ago
Alternatives and similar repositories for docker-hadoop-spark
Users that are interested in docker-hadoop-spark are comparing it to the libraries listed below
Sorting:
- ☆47Jul 4, 2023Updated 2 years ago
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆84Jan 2, 2025Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Zeppelin docker☆16Nov 16, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- Apache Hadoop docker image☆2,313Feb 1, 2024Updated 2 years ago
- ☆21Mar 11, 2025Updated 11 months ago
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- ☆14Mar 11, 2023Updated 2 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- CI/CD platform using Jenkins, docker, Sonar, Nexus, Jmeter, Selenium, Ansible, AWX, Grafana, Prometheus, Zabbix, Stress-ng☆21Feb 5, 2026Updated last month
- Big Data Docker Data Science Spark Spark4 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook☆19Feb 15, 2026Updated 3 weeks ago
- Docker with Airflow and Spark standalone cluster☆263Aug 5, 2023Updated 2 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆37Sep 27, 2019Updated 6 years ago
- Docker for airflow with mysql as backend☆12Nov 15, 2018Updated 7 years ago
- Apache Spark docker image☆2,059Apr 21, 2023Updated 2 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 4 years ago
- [EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook a…☆699Oct 1, 2020Updated 5 years ago
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆23Dec 29, 2020Updated 5 years ago
- Analytics Engineer Course☆20May 17, 2023Updated 2 years ago
- scrapper for various science databases☆11Sep 14, 2023Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- Responsive registration form with HTML and CSS☆11Oct 9, 2022Updated 3 years ago
- Composição Docker do Odoo com PostgreSQL e Nginx para subir e desenvolver com facilidade seu ERP☆34Dec 15, 2023Updated 2 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆50Dec 2, 2023Updated 2 years ago
- fedex-commercial-invoice☆21Apr 28, 2016Updated 9 years ago
- Dockerfiles and Docker Compose for HDP 2.6 with Blueprints☆23Jan 16, 2018Updated 8 years ago
- Docker with hadoop spark pig hive☆26Jul 22, 2019Updated 6 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆509Nov 7, 2025Updated 4 months ago
- Smart pet feeder using object detection and ESP32-cam☆22May 19, 2024Updated last year
- Advent of code - 30 challenges for learning Dagster☆27Dec 19, 2024Updated last year
- Reverse proxy in rust, build on top of hyper☆12Nov 8, 2022Updated 3 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆24Nov 29, 2021Updated 4 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated 2 years ago
- Starting up a Kubernetes cluster with Vagrant, with Gluster, Portworx, Linstor, or StorageOS as storage provider and Traefik as ingress c…☆11May 25, 2022Updated 3 years ago
- Node-RED Flow (and web page example) for the LLaMA AI model☆11Jul 27, 2023Updated 2 years ago