namebrandon / SparkovLinks
Markov Chain based fraud detection system in Spark.
☆14Updated 10 years ago
Alternatives and similar repositories for Sparkov
Users that are interested in Sparkov are comparing it to the libraries listed below
Sorting:
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆129Updated 2 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- Anomaly detection framework @ PayPal☆108Updated 6 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 6 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆73Updated 7 months ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆68Updated 10 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Some class materials for a data processing course using PySpark☆52Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆102Updated 5 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- An example PySpark project with pytest☆18Updated 8 years ago
- ☆40Updated 9 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Detecting outliers in a dataset using Spark☆41Updated 9 years ago
- Convert a CSV fle to ORCFile☆26Updated 6 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- PySpark phonetic and string matching algorithms☆41Updated last year
- Implementations of the Portable Format for Analytics (PFA)☆126Updated 3 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 7 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 3 years ago
- Some AWS EMR examples☆16Updated 8 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Updated 4 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 2 weeks ago
- These are some code examples☆56Updated 6 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- Spark Tutorial at the University of Maryland☆38Updated 11 years ago