abifet / moa
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
☆12Updated 6 years ago
Alternatives and similar repositories for moa:
Users that are interested in moa are comparing it to the libraries listed below
- DEBS 2015 - Realtime Analytics Patterns with WSO2 CEP, Siddhi & Apache Storm☆16Updated 2 years ago
- A collection of data sets for stream learning.☆32Updated 4 years ago
- Code for machine learning workshop given to Sanger Systems group☆40Updated 9 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Updated 5 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- ADWIN is an adaptive sliding window algorithm for detecting change and keeping updated statistics from a data stream, and use it as a bla…☆39Updated 7 years ago
- Files submitted to kdd2018 for EFDT paper☆22Updated 6 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- ☆15Updated 7 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 10 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 2 years ago
- Implementation of an online learning algorithm to do classification under concept drift☆23Updated 7 years ago
- Tensor-based Spectral LDA on Spark☆18Updated 6 years ago
- One of the ancestors of River☆37Updated 4 years ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- A catalog of Jupyter Notebooks presenting new techniques to interpret black box machine learning models.☆15Updated 6 years ago
- The agent code to pair with the article on ideaheap.☆10Updated 9 years ago
- ☆15Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Online Time Series Anomaly Detectors☆29Updated 2 years ago
- Latest version of GoFFish Distributed Graph Processing Platforms☆12Updated 6 years ago
- My Master Thesis on Distributed Deep Learning (parallelizing gradient descent) and other concepts I did during my research.☆26Updated 7 years ago
- RapidMiner Extension for Anomaly Detection☆94Updated 6 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Awesome Distributed Machine Learning Frameworks☆31Updated 7 years ago
- This code shows how to train a model in Amazon SageMaker using a custom loss function for a binary classification problem in which the co…☆13Updated 6 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆128Updated last year
- ☆8Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago