olalakul / Web-Server-Log-Analysis-PySpark
Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web server log data
☆18Updated 3 years ago
Alternatives and similar repositories for Web-Server-Log-Analysis-PySpark:
Users that are interested in Web-Server-Log-Analysis-PySpark are comparing it to the libraries listed below
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 6 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆19Updated 5 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- Multi-class classification model for predicting the types of crimes in Toronto☆14Updated last year
- ☆13Updated 2 years ago
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆16Updated 2 weeks ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- ☆11Updated 2 years ago
- ☆11Updated 5 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆48Updated last year
- ☆13Updated last year
- Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Fraud…