titicaca / spark-iforestView external linksLinks
Isolation Forest on Spark
☆232Oct 15, 2024Updated last year
Alternatives and similar repositories for spark-iforest
Users that are interested in spark-iforest are comparing it to the libraries listed below
Sorting:
- ☆19Feb 3, 2018Updated 8 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 6 years ago
- Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark☆88Dec 27, 2018Updated 7 years ago
- Spark Time Series Set data analysis☆12Dec 14, 2020Updated 5 years ago
- k-Nearest Neighbors algorithm on Spark☆240Nov 14, 2023Updated 2 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆182Jan 10, 2018Updated 8 years ago
- Apache Spark Scala utility to track data records during application execution☆11Jun 12, 2023Updated 2 years ago
- Extended Isolation Forest for Anomaly Detection☆484Nov 3, 2023Updated 2 years ago
- testing scikit-learn Isolation Forest☆77Apr 27, 2018Updated 7 years ago
- ☆21Mar 17, 2023Updated 2 years ago
- Distributed t-SNE via Apache Spark☆159Dec 9, 2017Updated 8 years ago
- Random Forests in Apache Spark☆71Jun 13, 2019Updated 6 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆130Jan 26, 2021Updated 5 years ago
- Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks su…☆175Jan 10, 2019Updated 7 years ago
- ☆11Dec 23, 2017Updated 8 years ago
- A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques☆9,710Jan 5, 2026Updated last month
- Simple and Distributed Machine Learning☆5,198Updated this week
- An "Efficient" Implementation of DBSCAN on PySpark☆29Jul 6, 2023Updated 2 years ago
- ☆14Aug 26, 2016Updated 9 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- DBSCAN implementation using Apache Spark☆48Feb 2, 2018Updated 8 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Jul 5, 2016Updated 9 years ago
- Scala library for fitting linear and generalised linear statistical models☆29Dec 29, 2024Updated last year
- Distributed Linear Programming Solver on top of Apache Spark☆79Jan 4, 2021Updated 5 years ago
- A scalable nearest neighbor search library in Apache Spark☆262Mar 29, 2019Updated 6 years ago
- Analyze Slack Channel history by counting threads with specified keywords☆12Feb 3, 2022Updated 4 years ago
- Reasonable API for serving TensorFlow models using Scala☆31Nov 2, 2017Updated 8 years ago
- Gaussian Process Classification and Regression on Apache Spark☆11Mar 29, 2021Updated 4 years ago
- ☆12Dec 6, 2016Updated 9 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- DBSCAN clustering algorithm on top of Apache Spark☆264Mar 28, 2018Updated 7 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Project defining the docker image that will support examples of algorithms created in this organization☆13Oct 22, 2017Updated 8 years ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Sep 7, 2023Updated 2 years ago
- WWW 2018: Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications☆478Mar 6, 2019Updated 6 years ago
- Supplementary material for ICDM 20 paper "COPOD: Copula-Based Outlier Detection"☆59Aug 31, 2020Updated 5 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Toying around with the TensorFlow Java API☆19Dec 20, 2017Updated 8 years ago
- PMML evaluator library for Apache Spark☆97Feb 8, 2026Updated last week