skoonData / apache-nifi
☆24Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for apache-nifi
- Multi-container environment with Hadoop, Spark and Hive☆203Updated 10 months ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆123Updated 2 years ago
- Youtube Apache NiFi 2022 Series resources☆71Updated last year
- ☆23Updated 3 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆44Updated 2 years ago
- This is a GitHub for all of my NiFi Templates☆43Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Apache Spark 3 - Structured Streaming Course Material☆119Updated last year
- Spark Examples☆124Updated 2 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆32Updated 4 years ago
- Delta Lake examples☆207Updated last month
- ☆141Updated last year
- Docker with Airflow and Spark standalone cluster☆245Updated last year
- pyspark framework☆25Updated 2 years ago
- Simple stream processing pipeline☆92Updated 5 months ago
- Building a Data Pipeline with an Open Source Stack☆38Updated 4 months ago
- ☆86Updated 2 years ago
- This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.☆23Updated last year
- EverythingApacheNiFi☆102Updated last year
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆55Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆56Updated last year
- Code and Notebooks for Spark Tutorials for Learning Journal @ Youtube☆55Updated 4 years ago
- End to end data engineering project☆51Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆21Updated 2 years ago
- Code snippets used in demos recorded for the blog.☆29Updated last month
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆37Updated last year
- ☆111Updated 4 years ago