hjben / docker-server
Repository for building docker image, with open-source applications
☆26Updated last year
Alternatives and similar repositories for docker-server
Users that are interested in docker-server are comparing it to the libraries listed below
Sorting:
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆64Updated 2 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆56Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆46Updated last year
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Updated 4 years ago
- spark on kubernetes☆105Updated 2 years ago
- Cluster in docker with Apache Atlas and a minimal Hadoop ecosystem to perform some basic experiments.☆26Updated 8 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆130Updated 2 years ago
- ☆14Updated 2 years ago
- ☆32Updated 7 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Updated 5 years ago
- apache-nifi-templates☆51Updated 4 years ago
- Apache Nifi Hello World Example☆22Updated 7 years ago
- Youtube Apache NiFi 2022 Series resources☆84Updated last year
- Assets used in Cloudera Tutorials☆19Updated 3 years ago
- Terraform / NiFi on the Google Cloud Platform☆28Updated 6 months ago
- ☆18Updated last year
- This is a GitHub for all of my NiFi Templates☆46Updated 4 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- ☆16Updated last year
- Apache Spark 3 - Structured Streaming Course Material☆122Updated last year
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- Setup Airflow & Pentaho (with Carte) in separate Docker containers☆14Updated 4 years ago
- Postgresql configured to work as metastore for Hive.☆32Updated 2 years ago
- Materials for the next course☆24Updated 2 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- Building a Data Pipeline with an Open Source Stack☆54Updated 10 months ago