sarit-si / docker-airflow-pdi-01Links
Setup Airflow & Pentaho (with Carte) in separate Docker containers
☆14Updated 4 years ago
Alternatives and similar repositories for docker-airflow-pdi-01
Users that are interested in docker-airflow-pdi-01 are comparing it to the libraries listed below
Sorting:
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 2 years ago
- Youtube Apache NiFi 2022 Series resources☆89Updated 2 years ago
- Collection of NiFi-related stuff☆25Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆49Updated 2 years ago
- Pentaho plugin for Apache Airflow - Orquestate pentaho transformations and jobs from Airflow☆40Updated 2 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆503Updated last month
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆289Updated last year
- Building a Data Pipeline with an Open Source Stack☆55Updated 5 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆181Updated 2 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- Dremio Container Tools☆164Updated 3 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆77Updated 4 years ago
- Delta Lake Documentation☆51Updated last year
- Fully reproducible, Dockerized, step-by-step, demo on how to stream tables from Postgres to Kafka/KSQL back to Postgres. Detailed blog p…☆152Updated 4 years ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆55Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- ☆91Updated 3 years ago
- A self-contained dbt project for testing purposes☆512Updated last year
- ☆14Updated 2 years ago
- ☆46Updated 2 years ago
- dbt module for myBI connect☆13Updated 2 years ago
- This is a GitHub for all of my NiFi Templates☆47Updated 5 years ago
- Quick Guides from Dremio on Several topics☆79Updated last month
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆135Updated 3 years ago
- Delta Lake examples☆235Updated last year
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28Updated 7 months ago
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆65Updated 7 months ago