fareespatel / E-Commerce-Datawarehouse-implementation
Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for E-Commerce-Datawarehouse-implementation
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆38Updated 3 years ago
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆16Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆32Updated 11 months ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆44Updated 4 years ago
- ☆12Updated 2 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆24Updated 6 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆45Updated last year
- YouTube tutorial project☆94Updated last year
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 5 years ago
- Simple ETL pipeline using Python☆21Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆63Updated last year
- ☆27Updated last year
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆28Updated 3 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆22Updated last month
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆14Updated 4 years ago
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆17Updated 5 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆13Updated last year
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…☆22Updated last year
- data-warehouse-snowflake-for-data-engineering☆14Updated last year
- ☆86Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆26Updated 3 years ago
- ☆21Updated last year
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆10Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- Multi-class classification model for predicting the types of crimes in Toronto☆13Updated 8 months ago