skoonData / apache-nifi
☆26Updated 7 months ago
Alternatives and similar repositories for apache-nifi:
Users that are interested in apache-nifi are comparing it to the libraries listed below
- Youtube Apache NiFi 2022 Series resources☆81Updated last year
- apache-nifi-templates☆51Updated 3 years ago
- ☆11Updated 3 years ago
- This is a GitHub for all of my NiFi Templates☆46Updated 4 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆61Updated 2 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆41Updated last year
- Building a Data Pipeline with an Open Source Stack☆50Updated 9 months ago
- Docker with Airflow and Spark standalone cluster☆253Updated last year
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆26Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆45Updated last year
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- ☆45Updated 4 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆130Updated 2 years ago
- ☆36Updated 2 years ago
- Quick Guides from Dremio on Several topics☆69Updated 2 months ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆18Updated 10 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆53Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated 10 months ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- ☆82Updated last month
- ☆87Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆109Updated last month
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆27Updated last year
- Companion repository for the book 'Delta Lake Up and Running'☆46Updated 11 months ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆47Updated last year
- Spark all the ETL Pipelines☆32Updated last year
- Apache Spark 3 - Structured Streaming Course Material☆121Updated last year
- Stream processing with Azure Databricks☆138Updated 3 months ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year