Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web server log data
☆18Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for Web-Server-Log-Analysis-PySpark
Users that are interested in Web-Server-Log-Analysis-PySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆19Oct 16, 2019Updated 6 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Dec 14, 2017Updated 8 years ago
- The smart city reference pipeline shows how to integrate various media building blocks, with analytics powered by the OpenVINO™ Toolkit, …☆218May 5, 2025Updated last year
- ☆12Jul 22, 2025Updated 10 months ago
- This is the LinkedIn Learning repository for Level Up: Python Data Acquisitions, Prep, & EDA.☆15Mar 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Collection of all the mini projects made by me so far.☆10Jan 4, 2022Updated 4 years ago
- Original Caliburn project from codeplex☆16Feb 21, 2020Updated 6 years ago
- A curated list of awesome iOS application security resources.☆11Jan 5, 2024Updated 2 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Simple wrapper over SOLR to emulate Azure Search (for development only)☆12Jul 8, 2017Updated 8 years ago
- End-to-end data engineering pipeline with various technologies to ingest real time data.☆27Nov 3, 2023Updated 2 years ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- ☆17Jan 23, 2021Updated 5 years ago
- A highly scalable real-time log anomaly detection architecture with LLMs, information retrieval, and user feedback to pinpoint faults acr…☆20Apr 27, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Analysis and Prediction of the Customer Churn Using Machine Learning Models (Highest Accuracy) and Plotly Library☆14Jan 5, 2023Updated 3 years ago
- Deep learning model of depression detection from activity sensor data☆14Dec 17, 2021Updated 4 years ago
- Flask based Web application for predicting the income of a person☆13Dec 23, 2018Updated 7 years ago
- Dapplo.CaliburnMicro is a Caliburn bootstrapper (and more) to quickly start with a WPF MVVM Application☆21May 14, 2021Updated 5 years ago
- This repo is for the Linkedin Learning course: Testing Python Data Science Code☆21Sep 26, 2025Updated 7 months ago
- This project consists of advanced phishing detection using the BERT masked language model.☆29Jan 31, 2024Updated 2 years ago
- Infuse AI into your application. Create and deploy a customer churn prediction model with IBM Cloud Private for Data, Db2 Warehouse, Spar…☆18Sep 17, 2025Updated 8 months ago
- Anomaly detection training suite☆120Nov 10, 2015Updated 10 years ago
- Content-based Movie Recommender☆16Aug 17, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Changing the styles of furniture and walls of bathroom images using maching learning techniques. This means, we are able to obtain an ima…☆17Jul 26, 2019Updated 6 years ago
- ☆13Jun 19, 2018Updated 7 years ago
- A python script for importing data into Firebase Cloud Firestore☆17Jun 30, 2023Updated 2 years ago
- VBA code of worksheet functions for linear and bilinear interpolation based on interp1 and interp2 in MATLAB☆28Aug 24, 2021Updated 4 years ago
- iTASK - Intelligent Traffic Analysis Software Kit☆30Dec 8, 2022Updated 3 years ago
- Series on Tensorflow starting from the basics and working our way up to more complex models☆18Sep 1, 2018Updated 7 years ago
- RealTime StockStream is a streamlined, simulation system for processing live stock market data. It uses Apache Kafka for data input, Apac…☆31Feb 18, 2025Updated last year
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆12Jul 16, 2019Updated 6 years ago
- Tutorial for creating a simple storage contract using Ethers.☆21Aug 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of a system capable of encryption and decryption of multimedia data (Text, Images, Videos, Audio etc.) using a hybrid mode…☆22Feb 7, 2024Updated 2 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- Simple way to send ether.☆24Nov 23, 2020Updated 5 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- ☆25Oct 13, 2019Updated 6 years ago
- Video surveillance units are usually the first element of a security system. While they are the most intuitive to understand and can be p…☆36Oct 18, 2014Updated 11 years ago