A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
☆23Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for covid-19-data-engineering-pipeline
Users that are interested in covid-19-data-engineering-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- CI/CD repository template to automate deployments of your production flows☆14Jul 1, 2024Updated last year
- Modern Data Stack in a box with dbt-duckdb and Apache Superset☆16Mar 5, 2026Updated 3 weeks ago
- Python package for Plotly/Dash apps with support for multi-page, modules, and new charts such as Pareto with an Object Orient Approach☆20Aug 5, 2022Updated 3 years ago
- example pipelines for deploying dbt via Azure DevOps pipelines☆21Apr 2, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated last year
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- Repository for the D ONE MLOps AWS BlogPost☆11Aug 13, 2024Updated last year
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…