Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline
☆12May 25, 2023Updated 2 years ago
Alternatives and similar repositories for Retailstore_ETL_pipeline_project
Users that are interested in Retailstore_ETL_pipeline_project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆11Oct 14, 2022Updated 3 years ago
- ☆15Jun 18, 2024Updated last year
- Top Picks for Data Science Self-Study: From Newbies to Pros!☆11Apr 2, 2024Updated 2 years ago
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- 🍔 A flutter app that shows food categories with various meals in each category and full description for each meal☆12Jan 30, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- A modern Login UI made with Flutter. It has Welcome Page, Login Page and Sign Up page.☆14Aug 9, 2021Updated 4 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- My personal page, CV and blog☆15Apr 8, 2026Updated last month
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Clickstream Faker Provider for Python.☆11Apr 2, 2022Updated 4 years ago
- My documents for self-learning fundamental of Data engineering skills☆14Aug 5, 2023Updated 2 years ago
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Jun 12, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆28Oct 13, 2023Updated 2 years ago
- Lập trình Python cho Máy học☆15Dec 25, 2021Updated 4 years ago
- A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.☆15Oct 27, 2022Updated 3 years ago
- Interesting Public Datasets☆12Apr 28, 2023Updated 3 years ago
- This project provides valuable customer sentiment insights for Zomato by tracking and analyzing tweets related to their brand and service…☆14Aug 27, 2023Updated 2 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- Large Language Models (LLMs) Learning Resources☆20Jun 16, 2024Updated last year
- Code snippets from Jason Brownlee's ML and Deep Learning books.☆12Mar 22, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project was created as a personal learning process. It is a simple example implementation of Azure Devops & Nodejs Application (Angu…☆11Oct 10, 2020Updated 5 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Data Structures in Python☆10Apr 27, 2026Updated last week
- It consists of all the code examples of big datascience course taken at algorithmica☆13Oct 7, 2018Updated 7 years ago
- This repository aims to develop a step-by-step tutorial on how to build a Kubeflow Pipeline from scratch in your local machine.☆42Jan 26, 2024Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆13Nov 6, 2023Updated 2 years ago
- ☆22Jan 22, 2018Updated 8 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- People ask me about data science resources so I've curated some here: this is <<20% of the size of an 'awesome' list but has 80% of the v…☆11Jan 14, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Todo Flutter application with sqflite as a local database and bloc state management.☆14Sep 23, 2021Updated 4 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆26Feb 9, 2021Updated 5 years ago
- Build Your Own Roadmap☆11Jul 8, 2020Updated 5 years ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆21Nov 14, 2022Updated 3 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆10Jan 24, 2021Updated 5 years ago
- This is a list of my published works. For more details check my Google Scholar profile.☆10Oct 1, 2020Updated 5 years ago
- 100DaysOfCode☆11Aug 18, 2020Updated 5 years ago