Joshua-omolewa / Retailstore_ETL_pipeline_projectLinks
Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆11Updated 2 years ago
Alternatives and similar repositories for Retailstore_ETL_pipeline_project
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below
Sorting:
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆88Updated 6 years ago
- ☆30Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆75Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆50Updated 6 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆107Updated 3 weeks ago
- ☆212Updated 2 years ago
- Simple ETL pipeline using Python☆29Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆199Updated last month
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆218Updated last year
- All Data Engineering notebooks from Datacamp course☆116Updated 6 years ago
- YouTube tutorial project☆107Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆30Updated 2 years ago
- ☆22Updated 2 years ago
- This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Progr…☆31Updated 2 years ago
- ☆148Updated 3 years ago
- ☆21Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 5 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆221Updated 2 years ago
- Git Repository☆151Updated 3 weeks ago
- ☆36Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆189Updated 5 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆160Updated 5 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 6 years ago
- ☆317Updated last year
- Ravi Azure ADB ADF Repository☆64Updated last year
- Udacity Data Engineering Nanodegree Capstone Project☆37Updated 5 years ago
- This is a template you can use for your next data engineering portfolio project.☆186Updated 4 years ago
- ☆163Updated 3 years ago