andreichiro / data_engineer_end2endLinks
End-to-end data engineer project
☆22Updated 2 years ago
Alternatives and similar repositories for data_engineer_end2end
Users that are interested in data_engineer_end2end are comparing it to the libraries listed below
Sorting:
- A project portfolio to accompany my resume☆29Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆44Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆92Updated last year
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆79Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆203Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆89Updated 8 months ago
- ☆36Updated 2 years ago
- ☆146Updated last year
- End-to-end ELT data engineering project☆22Updated 3 years ago
- Project for "Data pipeline design patterns" blog.☆49Updated last year
- capstone project for Dataengineer.io bootcamp Public Repo☆12Updated last year
- ☆30Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆41Updated last year
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆22Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Code for dbt tutorial☆166Updated 4 months ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆244Updated 3 years ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆25Updated 3 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Updated 4 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆281Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 3 years ago
- ☆15Updated last year
- ☆19Updated last year
- ☆147Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆27Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆107Updated 9 months ago