degagawolde / data-warehouse-dbt-airflow-postgress
A data-warehouse built for the pNEUMA open dataset of naturalistic trajectories of half a million vehicles collected by a swarm of drones in a congested downtown area of Athens, Greece.
☆10Updated last year
Related projects: ⓘ
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 5 months ago
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Data Engineering with Databricks Cookbook, published by Packt☆26Updated 3 months ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated last year
- Source code for 'Building a Data Warehouse' by Vincent Rainardi☆28Updated 7 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated last year
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- build dw with dbt☆26Updated last month
- Duke MIDS: Data Engineering and DataOps Course☆55Updated last year
- DataTalks Workshop Materials☆18Updated 6 months ago
- Repository for Data Engineering Zoomcamp 2024☆13Updated 5 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆54Updated last month
- Data Engineering with Scala, published by Packt☆16Updated 7 months ago
- ☆35Updated 2 months ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆20Updated 5 years ago
- Cost Efficient Data Pipelines with DuckDB☆42Updated last month
- Data pipeline that scrapes Rust cheater Steam profiles☆50Updated 2 years ago
- (Python, PySpark)☆11Updated 3 years ago
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆21Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆47Updated 3 months ago
- Analytics engineering with dbt - projects and developer environment☆16Updated 3 months ago
- ☆11Updated 2 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆18Updated last year
- ☆17Updated last year
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Code for data quality with greatexpectations blog☆10Updated last month
- ☆84Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆12Updated last year
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆17Updated 2 years ago
- A pipeline to detect data drift and retrain the model when there is drift☆20Updated last year