namebrandon / SparkovLinks
Markov Chain based fraud detection system in Spark.
☆13Updated 9 years ago
Alternatives and similar repositories for Sparkov
Users that are interested in Sparkov are comparing it to the libraries listed below
Sorting:
- Anomaly detection framework @ PayPal☆108Updated 6 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆127Updated last year
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆103Updated 4 years ago
- Detecting outliers in a dataset using Spark☆41Updated 9 years ago
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- FluRS: A Python library for streaming recommendation algorithms☆109Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Str…☆109Updated last year
- Examples for Fast Data Processing with Spark☆59Updated 12 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 2 months ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- Small Docker image with Python Machine Learning tools (~180MB) https://hub.docker.com/r/frolvlad/alpine-python-machinelearning/☆81Updated 5 months ago
- NiFi provenance reporting tasks☆14Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- CrowdRec reference framework☆32Updated 8 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29Updated 7 years ago
- ☆61Updated 9 years ago