Joshua-omolewa / Retailstore_ETL_pipeline_projectLinks
Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆9Updated 2 years ago
Alternatives and similar repositories for Retailstore_ETL_pipeline_project
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below
Sorting:
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆23Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆48Updated 5 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆28Updated last year
- ☆21Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆103Updated 4 years ago
- Simple ETL pipeline using Python☆26Updated 2 years ago
- Ravi Azure ADB ADF Repository☆66Updated 4 months ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆84Updated 5 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆147Updated 5 years ago
- Udacity Data Engineering Nanodegree Capstone Project☆36Updated 5 years ago
- YouTube tutorial project☆103Updated last year
- PySpark Projects☆23Updated 3 weeks ago
- ☆23Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆160Updated 2 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆27Updated 4 years ago
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- ☆40Updated 11 months ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- ☆35Updated 2 years ago
- Repository related to Spark SQL and Pyspark using Python3☆38Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆10Updated 4 years ago
- Recohut - Learn data engineering, data science☆97Updated last year
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆14Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆61Updated 2 years ago
- Course Material Data Engineering on AWS Course☆29Updated 9 months ago