olalakul / Web-Server-Log-Analysis-PySpark
Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web server log data
☆18Updated 3 years ago
Alternatives and similar repositories for Web-Server-Log-Analysis-PySpark
Users that are interested in Web-Server-Log-Analysis-PySpark are comparing it to the libraries listed below
Sorting:
- Cyber Security for Big Data and IoT using Machine Learning☆15Updated 6 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆19Updated 5 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- Big data projects implemented by Maniram yadav☆51Updated 7 years ago
- Multi-class classification model for predicting the types of crimes in Toronto☆14Updated last year
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆17Updated 3 weeks ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆16Updated last month
- ☆11Updated 5 years ago
- A Classification Problem which predicts if a loan will get approved or not.☆40Updated 2 years ago
- # **ABSTRACT** Main Objective: The main agenda of this project is: Perform extensive Exploratory Data Analys…☆32Updated 3 years ago
- Lending Club Data Loan Default Prediction☆55Updated 2 years ago
- Introduction In ecommerce companies like online retails, customer segmentation is necessary in order to understand customers behaviors. I…☆10Updated 5 years ago
- Big Data Management and Analysis Final Project☆68Updated 7 years ago
- ☆21Updated 2 years ago
- ☆20Updated 11 months ago
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆28Updated 4 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆17Updated 2 years ago
- Analysing the content of an E-commerce database that contains list of purchases. Based on the analysis, I develop a model that allows to …☆134Updated 7 years ago
- Analysis and Prediction of the Customer Churn Using Machine Learning Models (Highest Accuracy) and Plotly Library☆10Updated 2 years ago
- Data Science Capstone Project Using Python and Tableau 10☆51Updated 2 years ago
- ☆16Updated 2 years ago
- Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Fraud…☆82Updated 5 years ago
- India based Hardware company Sales Insights - A Data Analysis Project performed on Tableau & SQL☆44Updated 2 years ago
- Customer Analytics for a FMCG company (K-means clustering, PCA, logistic regression, linear regression)☆16Updated 4 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Updated 6 years ago