moritzkoerber / covid-19-data-engineering-pipelineLinks
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
☆23Updated last year
Alternatives and similar repositories for covid-19-data-engineering-pipeline
Users that are interested in covid-19-data-engineering-pipeline are comparing it to the libraries listed below
Sorting:
- Code for dbt tutorial☆162Updated last month
 - ☆38Updated 7 months ago
 - Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
 - Open Data Stack Projects: Examples of End to End Data Engineering Projects☆90Updated 2 years ago
 - Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
 - This repository provides various demos/examples of using Snowpark for Python.☆284Updated last year
 - Step-by-step tutorial on building a Kimball dimensional model with dbt☆152Updated last year
 - End to end data engineering project☆57Updated 3 years ago
 - A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated 11 months ago
 - Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
 - ☆23Updated 3 months ago
 - Execution of DBT models using Apache Airflow through Docker Compose☆122Updated 2 years ago
 - A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆54Updated 3 weeks ago
 - A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
 - Data pipeline with dbt, Airflow, Great Expectations☆164Updated 4 years ago
 - A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆78Updated 2 years ago
 - end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated 3 weeks ago
 - Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆119Updated 7 months ago
 - ☆20Updated last year
 - An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆55Updated last year
 - ☆80Updated last year
 - Repo for orienting dbt users to the Dagster asset framework☆55Updated 3 years ago
 - Simple stream processing pipeline☆110Updated last year
 - ☆80Updated last month
 - A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
 - A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 6 months ago
 - Generate DBT tests based on sample data☆39Updated last year
 - Repo for saving cheat sheets☆61Updated last year
 - Python project template for Snowpark development☆79Updated 2 years ago
 - Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆107Updated 2 years ago