andreichiro / data_engineer_end2end
End-to-end data engineer project
☆18Updated last year
Alternatives and similar repositories for data_engineer_end2end:
Users that are interested in data_engineer_end2end are comparing it to the libraries listed below
- build dw with dbt☆44Updated 6 months ago
- ☆40Updated 10 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated 2 years ago
- Tutorials/use cases of using Prefect in an ML project.☆41Updated 2 years ago
- Production ML rental prediction system.☆39Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆67Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆52Updated 9 months ago
- Sample project to demonstrate data engineering best practices☆189Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆32Updated last year
- This is a demo streaming project simulating a music streaming service.☆35Updated 8 months ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆79Updated this week
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Step by step instructions to create a production-ready data pipeline☆48Updated 4 months ago
- ☆34Updated last year
- ☆17Updated 9 months ago
- Realtime Data Engineering Project☆29Updated 3 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 11 months ago
- This is an overview of a MLOps architecture that includes both Airflow and MLflow running on separate Docker containers.☆21Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆45Updated 9 months ago
- Full stack data engineering tools and infrastructure set-up☆52Updated 4 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆37Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- Streamlit application to explore Snowflake Tables☆40Updated last year
- A Postgres data warehouse for processing synthetic data using IAC principles☆17Updated 2 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago
- Code for dbt tutorial☆157Updated 11 months ago