olalakul / Web-Server-Log-Analysis-PySpark
Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web server log data
☆16Updated 2 years ago
Alternatives and similar repositories for Web-Server-Log-Analysis-PySpark:
Users that are interested in Web-Server-Log-Analysis-PySpark are comparing it to the libraries listed below
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 6 years ago
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 6 years ago
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆18Updated 5 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- ☆13Updated 2 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆41Updated 4 years ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- Multi-class classification model for predicting the types of crimes in Toronto☆14Updated 10 months ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆13Updated this week
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆14Updated 4 years ago
- Big data projects implemented by Maniram yadav☆51Updated 6 years ago
- Analysis and Prediction of the Customer Churn Using Machine Learning Models (Highest Accuracy) and Plotly Library☆9Updated 2 years ago
- Introduction In ecommerce companies like online retails, customer segmentation is necessary in order to understand customers behaviors. I…☆10Updated 5 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆46Updated last year
- Data science virtual internship program by British Airways through Forage!☆35Updated 2 years ago
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆27Updated 3 years ago
- ☆21Updated last year
- # **ABSTRACT** Main Objective: The main agenda of this project is: Perform extensive Exploratory Data Analys…☆31Updated 3 years ago
- Big Data Management and Analysis Final Project☆66Updated 6 years ago
- Lending Club Data Loan Default Prediction☆52Updated last year
- A Classification Problem which predicts if a loan will get approved or not.☆40Updated 2 years ago
- India based Hardware company Sales Insights - A Data Analysis Project performed on Tableau & SQL☆37Updated 2 years ago
- You are opening a new Store at a particular location. Now, Given the Store Location, Area, Size and other params. Predict the overall rev…☆27Updated last year
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆20Updated last year
- Heart Strokes Predictions ML Model In Production☆44Updated 2 years ago
- PySpark Projects☆24Updated last week
- Data Science Capstone Project Using Python and Tableau 10☆49Updated 2 years ago
- Solved end-to-end machine learning projects☆32Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year