moritzkoerber / covid-19-data-engineering-pipeline
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
☆23Updated last year
Alternatives and similar repositories for covid-19-data-engineering-pipeline:
Users that are interested in covid-19-data-engineering-pipeline are comparing it to the libraries listed below
- End to end data engineering project☆53Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆78Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Code for dbt tutorial☆155Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆39Updated 4 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 4 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Generate DBT tests based on sample data☆36Updated last year
- Project for "Data pipeline design patterns" blog.☆45Updated 7 months ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆66Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆28Updated 2 months ago
- ☆75Updated 5 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆65Updated 6 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆89Updated this week
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- build dw with dbt☆43Updated 5 months ago
- dbt Project for Rapid Onboarding instructors to use in instruction and learners to reference throughout the course.☆25Updated this week
- ☆33Updated 3 weeks ago
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆132Updated 8 months ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆46Updated last year
- Example repo to create end to end tests for data pipeline.☆22Updated 9 months ago
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆126Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆21Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago