namebrandon / Sparkov
Markov Chain based fraud detection system in Spark.
☆10Updated 8 years ago
Alternatives and similar repositories for Sparkov:
Users that are interested in Sparkov are comparing it to the libraries listed below
- Chatlytics is a data query and visualization platform for chat!☆13Updated 7 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆128Updated last year
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 6 years ago
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Temporal_Graph_library☆25Updated 5 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 4 months ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Sketching data structures for scala, including t-digest☆15Updated 3 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆26Updated 6 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Spark Application UI extension for JupyterLab☆10Updated 3 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 8 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 8 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- A K8s-based infrastructure for analytics☆24Updated 5 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year