moritzkoerber / covid-19-data-engineering-pipeline

A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
23Updated 11 months ago

Related projects

Alternatives and complementary repositories for covid-19-data-engineering-pipeline