linkedin / isolation-forest
A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform inference.
☆233Updated last month
Alternatives and similar repositories for isolation-forest:
Users that are interested in isolation-forest are comparing it to the libraries listed below
- Isolation Forest on Spark☆227Updated 3 months ago
- Joblib Apache Spark Backend☆244Updated 5 months ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year
- A Scala feature transformation library for data science and machine learning☆465Updated 4 months ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆191Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- Sample application running fbprophet using spark☆49Updated 5 years ago
- Python library for converting Apache Spark ML pipelines to PMML☆95Updated last year
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆126Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,504Updated last month
- A scalable nearest neighbor search library in Apache Spark☆259Updated 5 years ago
- Train and run Pytorch models on Apache Spark.☆340Updated last year
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆101Updated 5 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆139Updated last year
- Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets☆54Updated last year
- Resources for Data Science Process management☆204Updated 5 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆291Updated 8 months ago
- Distributed scikit-learn meta-estimators in PySpark☆285Updated 9 months ago
- UpliftML: A Python Package for Scalable Uplift Modeling☆320Updated last year
- Java library and command-line application for converting XGBoost models to PMML☆129Updated last week
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆167Updated last year
- Jupyter kernel for scala and spark☆187Updated last year
- Use iterative feature pruning to identify hierarchical clusters.☆55Updated 5 years ago
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆268Updated 2 weeks ago
- Java library and command-line application for converting Scikit-Learn pipelines to PMML☆533Updated last week
- Python library for converting Scikit-Learn pipelines to PMML☆689Updated last month
- Scala Aggregators used for ML Model metrics monitoring☆91Updated last year