Isolation Forest on Spark
☆236Oct 15, 2024Updated last year
Alternatives and similar repositories for spark-iforest
Users that are interested in spark-iforest are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A distributed Spark/Scala implementation of the isolation forest and extended isolation forest algorithms for unsupervised outlier detect…☆256Apr 18, 2026Updated 3 weeks ago
- A parallel implementation of local outlier factor based on Spark☆17Jan 26, 2022Updated 4 years ago
- ☆19Feb 3, 2018Updated 8 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 7 years ago
- 运用孤立森林异常检测算法,过滤渗透测试和性能测试过程中产生的异常数据☆58Jul 17, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of DBSCAN runing on top of Apache Spark☆183Jan 10, 2018Updated 8 years ago
- Spark Time Series Set data analysis☆12Dec 14, 2020Updated 5 years ago
- IsolationForest wiht Sk-learn☆22Nov 20, 2019Updated 6 years ago
- Distributed Linear Programming Solver on top of Apache Spark☆80Jan 4, 2021Updated 5 years ago
- Extended Isolation Forest for Anomaly Detection☆493Nov 3, 2023Updated 2 years ago
- k-Nearest Neighbors algorithm on Spark☆241Nov 14, 2023Updated 2 years ago
- Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks su…☆176Jan 10, 2019Updated 7 years ago
- Distributed t-SNE via Apache Spark☆159Dec 9, 2017Updated 8 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆132Jan 26, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- ☆17Apr 8, 2019Updated 7 years ago
- A Python library for anomaly detection across tabular, time series, graph, text, and image data. 60+ detectors, benchmark-backed ADEngine…☆9,836Apr 16, 2026Updated 3 weeks ago
- Simple and Distributed Machine Learning☆5,226Apr 24, 2026Updated 2 weeks ago
- Apache Spark Scala utility to track data records during application execution☆11Jun 12, 2023Updated 2 years ago
- A scalable nearest neighbor search library in Apache Spark☆262Mar 29, 2019Updated 7 years ago
- CentOS based Docker container for Time Series Analysis and Modeling.☆22Sep 19, 2019Updated 6 years ago
- An "Efficient" Implementation of DBSCAN on PySpark☆29Jul 6, 2023Updated 2 years ago
- DBSCAN clustering algorithm on top of Apache Spark☆264Mar 28, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Generic Density Based Clustering☆23Jul 15, 2025Updated 9 months ago
- AIS visualization from an interactive R and Shiny based web app using Material Design from Google.☆13Sep 13, 2018Updated 7 years ago
- Supplementary material for ICDM 20 paper "COPOD: Copula-Based Outlier Detection"☆59Aug 31, 2020Updated 5 years ago
- Fast bottom up trend reversal detection algorithm.☆14Oct 1, 2020Updated 5 years ago
- Synthesis with Metaheuristics - Genetic Programming in Scala☆15Oct 4, 2019Updated 6 years ago
- Spark / graphX implementation of the distributed louvain modularity algorithm☆318Sep 2, 2020Updated 5 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Dec 29, 2023Updated 2 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Project defining the docker image that will support examples of algorithms created in this organization☆13Oct 22, 2017Updated 8 years ago
- Random Forests in Apache Spark☆72Jun 13, 2019Updated 6 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆105Jul 5, 2016Updated 9 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,855Jul 10, 2023Updated 2 years ago