namebrandon / SparkovLinks
Markov Chain based fraud detection system in Spark.
☆12Updated 9 years ago
Alternatives and similar repositories for Sparkov
Users that are interested in Sparkov are comparing it to the libraries listed below
Sorting:
- Detecting outliers in a dataset using Spark☆41Updated 9 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- A collection of datasets and databases☆24Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆127Updated last year
- Labs and data files for a full-day Spark workshop☆24Updated last month
- A K8s-based infrastructure for analytics☆24Updated 5 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated 3 months ago
- ☆35Updated 9 years ago
- NiFi provenance reporting tasks☆14Updated last year
- Example python spark machine learning on NYC taxi data☆9Updated 10 years ago
- Example Tensorflow Processor using Java API for Apache NiFi 1.2 - 1.9.1+☆39Updated 5 years ago
- My data is bigger than your data!☆39Updated 6 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- graphx example☆24Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Streaming outlier analysis☆14Updated 8 years ago
- Repository for the Spark-Vector connector☆20Updated 3 years ago
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 5 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Binding the GDELT universe in a Spark environment☆25Updated 2 years ago