A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform inference.
☆252Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for isolation-forest
Users that are interested in isolation-forest are comparing it to the libraries listed below
Sorting:
- Isolation Forest on Spark☆233Oct 15, 2024Updated last year
- Google Maps geocoding library for Scala☆12Oct 12, 2019Updated 6 years ago
- ☆14Nov 27, 2025Updated 3 months ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆173Dec 19, 2025Updated 2 months ago
- N-dimensional / multi-dimensional arrays (tensors) in Scala 3. Think NumPy ndarray / PyTorch Tensor but type-safe over shapes, array/axis…☆47Dec 22, 2022Updated 3 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 6 years ago
- Machine Learning tools for Space Weather and Plasma Physics☆17Sep 1, 2022Updated 3 years ago
- Simple and Distributed Machine Learning☆5,201Feb 14, 2026Updated 3 weeks ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆24Jun 20, 2017Updated 8 years ago
- Course materials for BANA 7052 (Applied Linear Regression) at UC☆15Oct 11, 2020Updated 5 years ago
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Dec 30, 2019Updated 6 years ago
- Extended Isolation Forest for Anomaly Detection☆485Nov 3, 2023Updated 2 years ago
- Utilities to Retrieve Rulelists from Model Fits, Filter, Prune, Reorder and Predict on unseen data☆11Feb 4, 2025Updated last year
- Fast generalised linear models by sampling and one-step polishing☆20Sep 23, 2018Updated 7 years ago
- deep learning and scientific computing framework with native CPU and GPU backend for the Scala programming language☆30Apr 22, 2025Updated 10 months ago
- Implementation of algorithms from the paper "Globally-Consistent Rule-Based Summary-Explanations for Machine Learning Models: Application …☆25Jun 4, 2022Updated 3 years ago
- A Time Series Library for Apache Spark☆1,022Jul 3, 2020Updated 5 years ago
- Uses simple Bayesian conjugate prior update rules to calculate metrics for various marketing objectives☆11Oct 9, 2023Updated 2 years ago
- VecoLuc is a scalable vector search engine that leverages Apache Lucene and the JDK's incubator vector API for high-performance vector op…☆11Aug 22, 2024Updated last year
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 3 years ago
- PMML scoring library for Scala☆66Oct 18, 2025Updated 4 months ago
- Simple implementations of forward- and backward-mode automatic differentation in Scala☆23Jun 21, 2018Updated 7 years ago
- qdapTools is an R package that contains tools associated with the qdap package that may be useful outside of the context of text analysis…☆15May 10, 2023Updated 2 years ago
- Fast streams for Scala 3☆57Feb 8, 2025Updated last year
- R package for weighted model metrics☆11Apr 12, 2025Updated 10 months ago
- Type-safe, high performance, distributed Neural networks in Scala☆29Nov 20, 2023Updated 2 years ago
- Rcpp Interface to mlpack (version 2.1.0 and up)☆24Jan 31, 2021Updated 5 years ago
- Build integrator for Java, Scala, Scala.macro, Scala.js, Scala.native, Eclipse and Maven.☆51May 2, 2019Updated 6 years ago
- (MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)☆391Mar 24, 2025Updated 11 months ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques☆9,729Mar 1, 2026Updated last week
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- A scalable nearest neighbor search library in Apache Spark☆262Mar 29, 2019Updated 6 years ago
- Implementation of TANE for experimental purposes☆15Apr 29, 2022Updated 3 years ago
- Scala 3.x wrapper for Apache Flink☆50Mar 5, 2023Updated 3 years ago
- ☆12Dec 19, 2016Updated 9 years ago
- ☆13Oct 22, 2019Updated 6 years ago
- eXtreme RuleFit (sparse linear models on XGBoost ensembles)☆44Dec 17, 2025Updated 2 months ago