andreichiro / data_engineer_end2endLinks
End-to-end data engineer project
☆24Updated 2 years ago
Alternatives and similar repositories for data_engineer_end2end
Users that are interested in data_engineer_end2end are comparing it to the libraries listed below
Sorting:
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆42Updated last year
- A project portfolio to accompany my resume☆29Updated 2 years ago
- capstone project for Dataengineer.io bootcamp Public Repo☆12Updated last year
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆13Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆88Updated 6 months ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆79Updated 2 years ago
- Get Crypto data from API, stream it to Kafka with Airflow. Write data to MySQL and visualize with Metabase☆16Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆91Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆106Updated 8 months ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 4 years ago
- Sample project to demonstrate data engineering best practices☆200Updated last year
- ☆36Updated 2 years ago
- ☆29Updated 2 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆281Updated last year
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆46Updated 3 years ago
- Repository for Data Engineering Interview Series☆33Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Updated last year
- End-to-end ELT data engineering project☆22Updated 2 years ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆24Updated 3 years ago
- ☆15Updated last year
- ☆142Updated 2 years ago
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆29Updated 2 years ago
- ☆19Updated last year
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆37Updated 2 years ago
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆68Updated 2 weeks ago
- Code for the Data Engineering Zoomcamp☆20Updated 2 years ago