judeleonard / Prescriber-ETL-data-pipeline

An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
β˜†25Updated 2 years ago

Alternatives and similar repositories for Prescriber-ETL-data-pipeline:

Users that are interested in Prescriber-ETL-data-pipeline are comparing it to the libraries listed below