Isolation Forest on Spark
☆233Oct 15, 2024Updated last year
Alternatives and similar repositories for spark-iforest
Users that are interested in spark-iforest are comparing it to the libraries listed below
Sorting:
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆252Feb 11, 2026Updated 3 weeks ago
- ☆19Feb 3, 2018Updated 8 years ago
- A parallel implementation of local outlier factor based on Spark☆17Jan 26, 2022Updated 4 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 6 years ago
- Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark☆88Dec 27, 2018Updated 7 years ago
- k-Nearest Neighbors algorithm on Spark☆240Nov 14, 2023Updated 2 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Jan 10, 2018Updated 8 years ago
- ☆21Mar 17, 2023Updated 2 years ago
- Distributed t-SNE via Apache Spark☆160Dec 9, 2017Updated 8 years ago
- Distributed Linear Programming Solver on top of Apache Spark☆80Jan 4, 2021Updated 5 years ago
- Random Forests in Apache Spark☆71Jun 13, 2019Updated 6 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆131Jan 26, 2021Updated 5 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Transformer-based Conditional Generative Adversarial Network for Multivariate Time Series Generation (IWTA - PAKDD2023)☆11May 1, 2023Updated 2 years ago
- ☆11Dec 23, 2017Updated 8 years ago
- Simple and Distributed Machine Learning☆5,207Updated this week
- A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques☆9,742Mar 1, 2026Updated last week
- An "Efficient" Implementation of DBSCAN on PySpark☆29Jul 6, 2023Updated 2 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- IsolationForest wiht Sk-learn☆22Nov 20, 2019Updated 6 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Jul 5, 2016Updated 9 years ago
- Scala library for fitting linear and generalised linear statistical models☆29Dec 29, 2024Updated last year
- A scalable nearest neighbor search library in Apache Spark☆262Mar 29, 2019Updated 6 years ago
- Reasonable API for serving TensorFlow models using Scala☆31Nov 2, 2017Updated 8 years ago
- Analyze Slack Channel history by counting threads with specified keywords☆12Feb 3, 2022Updated 4 years ago
- ☆12Dec 6, 2016Updated 9 years ago
- AIS visualization from an interactive R and Shiny based web app using Material Design from Google.☆13Sep 13, 2018Updated 7 years ago
- Gaussian Process Classification and Regression on Apache Spark☆11Mar 29, 2021Updated 4 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- DBSCAN clustering algorithm on top of Apache Spark☆264Mar 28, 2018Updated 7 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Python + Numpy implementation of the Gene Expression Programming Evolutionary Algorithm☆11Sep 18, 2017Updated 8 years ago
- Converts 3D file formats to Minecraft schematics☆14Mar 8, 2013Updated 13 years ago
- Project defining the docker image that will support examples of algorithms created in this organization☆13Oct 22, 2017Updated 8 years ago
- Docker images to build and generate native artifacts using GraalVM☆59May 15, 2018Updated 7 years ago
- Fast bottom up trend reversal detection algorithm.☆14Oct 1, 2020Updated 5 years ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Sep 7, 2023Updated 2 years ago