Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web server log data
☆18Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for Web-Server-Log-Analysis-PySpark
Users that are interested in Web-Server-Log-Analysis-PySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆23Jan 28, 2018Updated 8 years ago
- This is the LinkedIn Learning repository for Level Up: Python Data Acquisitions, Prep, & EDA.☆15Mar 4, 2025Updated last year
- Collection of all the mini projects made by me so far.☆10Jan 4, 2022Updated 4 years ago
- A web application which acts as an IoT device when loaded in a smart phone browser. The data from the sensors are then used for Anomaly d…☆11Feb 4, 2021Updated 5 years ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multiple coding projects completed in Python☆11Jun 10, 2014Updated 11 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Tutorial to learn how to utilize NLP to analyze your own LinkedIn data.☆11Jul 28, 2021Updated 4 years ago
- Simple wrapper over SOLR to emulate Azure Search (for development only)☆12Jul 8, 2017Updated 8 years ago
- A simple php toolbox to interact with the Microsoft Azure Search Service REST API.☆11Feb 2, 2023Updated 3 years ago
- The repository contains my work on data analytics on Relevel provided dataset for resourceful insights for the marketing team, alongwith …☆11Aug 22, 2021Updated 4 years ago
- Detection of fine-grained emotions in texts☆12Apr 6, 2021Updated 5 years ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep learning model of depression detection from activity sensor data☆14Dec 17, 2021Updated 4 years ago
- This project consists of advanced phishing detection using the BERT masked language model.☆29Jan 31, 2024Updated 2 years ago
- Case Studies and Projects in Machine Learning/EDA/DL☆24Jun 18, 2024Updated last year
- Infuse AI into your application. Create and deploy a customer churn prediction model with IBM Cloud Private for Data, Db2 Warehouse, Spar…☆18Sep 17, 2025Updated 7 months ago
- Content-based Movie Recommender☆16Aug 17, 2018Updated 7 years ago
- Anomaly detection training suite☆120Nov 10, 2015Updated 10 years ago
- A Data Visualization project on the French traffic accidents database☆19Aug 27, 2019Updated 6 years ago
- ☆13Jun 19, 2018Updated 7 years ago
- This repository applies Deep Learning techniques for depression detection in text, using LSTM, GRU, BiLSTM, BERT models, and a baseline F…☆19Jul 14, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆21Apr 8, 2025Updated last year
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆17Jul 17, 2018Updated 7 years ago
- A simple webapp for memorizing multiple choice answers☆16Mar 19, 2021Updated 5 years ago
- RealTime StockStream is a streamlined, simulation system for processing live stock market data. It uses Apache Kafka for data input, Apac…☆31Feb 18, 2025Updated last year
- Data visualisations in Power BI☆31Nov 14, 2021Updated 4 years ago
- ☆30Jan 17, 2023Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆12Jul 16, 2019Updated 6 years ago
- Follow the Lumiata Tech Blog on Medium!☆21May 8, 2023Updated 2 years ago
- A project to detect accident and send notification to hospitals whenever a accident happens.☆21Mar 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Apr 18, 2020Updated 6 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- Raspberry Pi streaming demo with Standalone Kafka☆22Jan 22, 2020Updated 6 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆33Nov 9, 2023Updated 2 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- Demo code for the lightning talk on sentiment analysis in Pychennai meetup☆24Aug 10, 2017Updated 8 years ago
- A python script that uses the Tweepy library to pull Tweets from Twitter's Streaming API, and then stores the important fields in a Mongo…☆23Aug 19, 2014Updated 11 years ago