namebrandon / SparkovLinks
Markov Chain based fraud detection system in Spark.
☆13Updated 9 years ago
Alternatives and similar repositories for Sparkov
Users that are interested in Sparkov are comparing it to the libraries listed below
Sorting:
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- Anomaly detection framework @ PayPal☆108Updated 6 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 3 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 12 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆128Updated last year
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated last year
- PySpark phonetic and string matching algorithms☆39Updated last year
- Query testing framework☆71Updated 2 weeks ago
- Some AWS EMR examples☆16Updated 7 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆102Updated 5 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- A tool for anomaly detection over streaming data based on sentiment analysis☆30Updated 7 years ago
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 3 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- Chorus, now for Elasticsearch!☆16Updated last year
- Graph Analytics with Apache Kafka☆106Updated 2 weeks ago
- Some class materials for a data processing course using PySpark☆52Updated 3 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- ☆12Updated 6 years ago
- Spark Tutorial at the University of Maryland☆38Updated 11 years ago