Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆12May 25, 2023Updated 2 years ago
Alternatives and similar repositories for Retailstore_ETL_pipeline_project
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆10Oct 14, 2022Updated 3 years ago
- ☆15Jun 18, 2024Updated last year
- Top Picks for Data Science Self-Study: From Newbies to Pros!☆11Apr 2, 2024Updated 2 years ago
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- 🍔 A flutter app that shows food categories with various meals in each category and full description for each meal☆12Jan 30, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- A modern Login UI made with Flutter. It has Welcome Page, Login Page and Sign Up page.☆14Aug 9, 2021Updated 4 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- My personal page, CV and blog☆15Apr 8, 2026Updated last week
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- My documents for self-learning fundamental of Data engineering skills☆14Aug 5, 2023Updated 2 years ago
- Clickstream Faker Provider for Python.☆11Apr 2, 2022Updated 4 years ago
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Jun 12, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆28Oct 13, 2023Updated 2 years ago
- Lập trình Python cho Máy học☆15Dec 25, 2021Updated 4 years ago
- A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.☆16Oct 27, 2022Updated 3 years ago
- Interesting Public Datasets