alessandrolulli / reforest
Random Forests in Apache Spark
☆71Updated 5 years ago
Alternatives and similar repositories for reforest:
Users that are interested in reforest are comparing it to the libraries listed below
- Spark Time Series Set data analysis☆12Updated 4 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆130Updated 4 years ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆134Updated 2 years ago
- Ensemble Learning for Apache Spark 🌲☆23Updated 5 months ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- Some Spark implementations of clustering algorithms.☆19Updated 6 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 5 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- k-Nearest Neighbors algorithm on Spark☆239Updated last year
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 4 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- Isolation Forest on Spark☆227Updated 4 months ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- RapidMiner Extension for Anomaly Detection☆93Updated 5 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Map Reduce Implementation of Connected Component on Apache Spark☆84Updated 3 years ago
- ☆52Updated 7 years ago
- An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).☆83Updated 2 years ago
- KEEL: Knowledge Extraction based on Evolutionary Learning☆128Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆237Updated 2 months ago
- SOUL: Scala Oversampling and Undersampling Library.☆13Updated 5 years ago
- ☆16Updated 9 years ago
- This package contains the code for executing clustering validity indices in Spark. The package includes BD-Silhouette, BD-Dunn, Davies-Bo…☆10Updated 6 years ago
- A Locality-Sensitive Hashing Library for Scala with optional Redis storage.☆16Updated 3 years ago
- A implementation of the Self-Tuning Spectral Clustering algorithm, and more.☆12Updated 8 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Updated 8 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- Scala Library/REPL for Machine Learning Research☆201Updated last year