rickyriled / data_engineering_project_1
My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for data_engineering_project_1
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆22Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆24Updated last year
- Sample project to demonstrate data engineering best practices☆167Updated 9 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 2 months ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆23Updated 9 months ago
- End to end data engineering project☆51Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆41Updated 3 months ago
- ☆105Updated 3 months ago
- ☆16Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 2 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆24Updated 3 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆113Updated 4 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆67Updated last month
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆31Updated last year
- Code for dbt tutorial☆143Updated 5 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆71Updated last year
- Simple stream processing pipeline☆92Updated 5 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆123Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆42Updated 2 years ago
- ☆94Updated 2 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- Repository for Data Engineering Zoomcamp 2024☆13Updated 8 months ago
- Template for Data Engineering and Data Pipeline projects☆104Updated last year
- Macros for generating dbt model data profiles☆81Updated this week
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆56Updated 5 months ago
- A curated list of awesome public DBT projects☆95Updated 10 months ago
- Repo for saving cheat sheets☆42Updated 5 months ago
- ☆29Updated last week
- ☆70Updated last month