namebrandon / Sparkov
Markov Chain based fraud detection system in Spark.
☆9Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Sparkov
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆26Updated 5 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 2 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆66Updated 8 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆129Updated 10 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- Machine Learning for Cascading☆82Updated 9 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- Anomaly detection framework @ PayPal☆107Updated 5 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- ☆38Updated 8 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 6 years ago
- Infrastructure setup.☆11Updated 5 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- VoltDB Click Stream Processing Example.☆16Updated 6 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 7 years ago
- ☆8Updated 6 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Security log file challenge☆28Updated 8 years ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago