Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆13May 25, 2023Updated 3 years ago
Alternatives and similar repositories for Retailstore_ETL_pipeline_project
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆11Oct 14, 2022Updated 3 years ago
- ☆16Jun 18, 2024Updated last year
- Top Picks for Data Science Self-Study: From Newbies to Pros!☆11Apr 2, 2024Updated 2 years ago
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- 🍔 A flutter app that shows food categories with various meals in each category and full description for each meal☆12Jan 30, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- A modern Login UI made with Flutter. It has Welcome Page, Login Page and Sign Up page.☆14Aug 9, 2021Updated 4 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- My personal page, CV and blog☆15May 8, 2026Updated 3 weeks ago
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Clickstream Faker Provider for Python.☆11Apr 2, 2022Updated 4 years ago
- My documents for self-learning fundamental of Data engineering skills☆14Aug 5, 2023Updated 2 years ago
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Jun 12, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆28Oct 13, 2023Updated 2 years ago
- Lập trình Python cho Máy học☆15Dec 25, 2021Updated 4 years ago
- A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.☆16Oct 27, 2022Updated 3 years ago
- Interesting Public Datasets☆12Apr 28, 2023Updated 3 years ago
- This project provides valuable customer sentiment insights for Zomato by tracking and analyzing tweets related to their brand and service…☆14Aug 27, 2023Updated 2 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- Code snippets from Jason Brownlee's ML and Deep Learning books.☆12Mar 22, 2017Updated 9 years ago
- This project was created as a personal learning process. It is a simple example implementation of Azure Devops & Nodejs Application (Angu…☆11Oct 10, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Large Language Models (LLMs) Learning Resources☆21Jun 16, 2024Updated last year
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Data Structures in Python☆10May 18, 2026Updated last week
- It consists of all the code examples of big datascience course taken at algorithmica☆13Oct 7, 2018Updated 7 years ago
- This repository aims to develop a step-by-step tutorial on how to build a Kubeflow Pipeline from scratch in your local machine.☆42Jan 26, 2024Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆13Nov 6, 2023Updated 2 years ago
- ☆23Jan 22, 2018Updated 8 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- People ask me about data science resources so I've curated some here: this is <<20% of the size of an 'awesome' list but has 80% of the v…☆11Jan 14, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Todo Flutter application with sqflite as a local database and bloc state management.☆14Sep 23, 2021Updated 4 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆27Feb 9, 2021Updated 5 years ago
- Build Your Own Roadmap☆11Jul 8, 2020Updated 5 years ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆21Nov 14, 2022Updated 3 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆10Jan 24, 2021Updated 5 years ago
- This is a list of my published works. For more details check my Google Scholar profile.☆10Oct 1, 2020Updated 5 years ago
- 100DaysOfCode☆11Aug 18, 2020Updated 5 years ago