Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
☆21Jan 30, 2019Updated 7 years ago
Alternatives and similar repositories for live_log_analyzer_spark
Users that are interested in live_log_analyzer_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆23Oct 14, 2021Updated 4 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- ☆16May 29, 2023Updated 3 years ago
- Multiple coding projects completed in Python☆11Jun 10, 2014Updated 12 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- Implementation of YOLO algorithm for real-time object detection and classification☆11Sep 13, 2019Updated 6 years ago
- A simple php toolbox to interact with the Microsoft Azure Search Service REST API.☆11Feb 2, 2023Updated 3 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- ☆11Jun 15, 2019Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Dec 12, 2018Updated 7 years ago
- A data engineering pipeline for digital marketers.☆11Dec 21, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- Simple Twitter bot using Tweepy and Python☆16Jan 20, 2017Updated 9 years ago
- Docker template for basic data science packages to interface with Neo4j☆14Nov 8, 2021Updated 4 years ago
- A Python script to swoop and decrypt passwords from Chrome's local storage.☆11Dec 10, 2018Updated 7 years ago
- A fast and low memory requirement version of PointHop and PointHop++, which is built upon Apache Spark.☆10Jul 14, 2020Updated 5 years ago
- A Serverless function for posting to a Slack Webhook in response to a Mailgun route☆11Oct 12, 2016Updated 9 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Dec 3, 2020Updated 5 years ago
- Spark Projects for the Berkeley Data Science Course☆13Aug 12, 2015Updated 10 years ago
- ☆10Apr 3, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python scripts for Agisoft Photoscan☆12Jun 18, 2015Updated 11 years ago
- Python solutions to problems posted on http://codility.com/☆11Nov 13, 2013Updated 12 years ago
- Dump the saved wifi passwords for windows using regular expressions and python 3☆17Dec 22, 2016Updated 9 years ago
- A flask based python app offering flight recommendations☆10Sep 26, 2016Updated 9 years ago
- Serverless function to automate enforcement of Multi-Factor Authentication (MFA) to all AWS IAM users with access to AWS Management Conso…☆13Oct 30, 2018Updated 7 years ago
- Notes, Ideas, and Projects related to my Springboard data science career track☆11Jun 23, 2017Updated 8 years ago
- Analyzing Big Data with Amazon EMR☆12Sep 14, 2020Updated 5 years ago
- This is a pipeline of an ETL application in GCP with open airport code data, which you can find here: https://datahub.io/core/airport-cod…☆15Nov 15, 2021Updated 4 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Marshmallow serializer integration with pyspark☆12Dec 29, 2023Updated 2 years ago
- We store attacks and exploits that we've found useful in our research☆13Jun 4, 2015Updated 11 years ago
- noiseprint2 is a porting of noiseprint to tensorflow 2 and keras☆12Feb 20, 2021Updated 5 years ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Jul 16, 2017Updated 8 years ago
- Flask based Web application for predicting the income of a person☆13Dec 23, 2018Updated 7 years ago
- Google Home assistant for music recommendations, built with Python & Flask. Using Google Home and API.ai☆15Dec 19, 2017Updated 8 years ago
- Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average…☆15Apr 5, 2022Updated 4 years ago