Joshua-omolewa / Retailstore_ETL_pipeline_project
Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆10Updated last year
Alternatives and similar repositories for Retailstore_ETL_pipeline_project:
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆107Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- ☆28Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆19Updated last year
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆52Updated last month
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆43Updated 5 years ago
- ☆23Updated last year
- YouTube tutorial project☆98Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- ☆19Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆137Updated 4 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆80Updated 5 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆27Updated 4 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆46Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- data-warehouse-snowflake-for-data-engineering☆14Updated last year
- ☆41Updated 7 months ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Updated last year
- ☆87Updated 2 years ago
- Ravi Azure ADB ADF Repository☆66Updated 3 weeks ago
- Data Engineering Project in GCP☆18Updated last year
- Repository related to Spark SQL and Pyspark using Python3☆37Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆106Updated 9 months ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆119Updated 6 months ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆47Updated 7 years ago