rickyriled / data_engineering_project_1Links
My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform
☆15Updated 2 years ago
Alternatives and similar repositories for data_engineering_project_1
Users that are interested in data_engineering_project_1 are comparing it to the libraries listed below
Sorting:
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆25Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Updated 9 months ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- End to end data engineering project☆56Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆194Updated last year
- ☆110Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆82Updated last month
- Code Repository for my 3rd Data Project.☆15Updated 2 years ago
- ☆37Updated 3 months ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆119Updated 3 months ago
- Project for "Data pipeline design patterns" blog.☆45Updated 10 months ago
- A monorepo combining data modelling in dbt with data viz using Evidence☆23Updated 2 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- Code for dbt tutorial☆156Updated 3 weeks ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 10 months ago
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- Data Engineering Project in GCP☆20Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆43Updated 7 months ago
- ☆17Updated 2 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆117Updated 2 months ago
- ☆132Updated 11 months ago
- Apartments Data Pipeline using Airflow and Spark.☆21Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- ☆28Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- ☆80Updated 8 months ago