saurfang/spark-knn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/saurfang/spark-knn)

saurfang / spark-knn

k-Nearest Neighbors algorithm on Spark

☆241

Alternatives and similar repositories for spark-knn

Users that are interested in spark-knn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karlhigley / spark-neighbors
View on GitHub
Spark-based approximate nearest neighbor search using locality-sensitive hashing
☆104Jul 5, 2016Updated 10 years ago
viirya / SparkAffinityPropagation
View on GitHub
Affinity Propagation on Spark
☆20May 31, 2021Updated 5 years ago
saurfang / spark-tsne
View on GitHub
Distributed t-SNE via Apache Spark
☆158Dec 9, 2017Updated 8 years ago
tdebatty / spark-knn-graphs
View on GitHub
Spark algorithms for building k-nn graphs
☆41Nov 26, 2018Updated 7 years ago
marufaytekin / lsh-spark
View on GitHub
Locality Sensitive Hashing for Apache Spark
☆87Feb 5, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
em3s / spark-annoy
View on GitHub
Building Annoy Index on Apache Spark
☆72Jan 5, 2021Updated 5 years ago
irvingc / dbscan-on-spark
View on GitHub
An implementation of DBSCAN runing on top of Apache Spark
☆181Jan 10, 2018Updated 8 years ago
evancasey / spark-knn-recommender
View on GitHub
Item and User-based KNN recommendation algorithms using PySpark
☆124Nov 14, 2017Updated 8 years ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
GrowinScala / Flipper
View on GitHub
PDF to JSON, JSON to PDF and etc.
☆12Apr 18, 2018Updated 8 years ago
sramirez / spark-infotheoretic-feature-selection
View on GitHub
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…
☆134May 5, 2022Updated 4 years ago
syoummer / SpatialSpark
View on GitHub
Big Spatial Data Processing using Spark
☆146Mar 7, 2017Updated 9 years ago
titicaca / spark-iforest
View on GitHub
Isolation Forest on Spark
☆237Oct 15, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
databricks / spark-sklearn
View on GitHub
(Deprecated) Scikit-learn integration package for Apache Spark
☆1,071Dec 3, 2019Updated 6 years ago
LinkedInAttic / scanns
View on GitHub
A scalable nearest neighbor search library in Apache Spark
☆263Mar 29, 2019Updated 7 years ago
mjuez / approx-smote
View on GitHub
Approx-SMOTE: fast SMOTE for Big Data on Apache Spark
☆18Apr 27, 2022Updated 4 years ago
TalkingData / Fregata
View on GitHub
A light weight, super fast, large scale machine learning library on spark .
☆676Mar 23, 2018Updated 8 years ago
databricks / spark-corenlp
View on GitHub
Stanford CoreNLP wrapper for Apache Spark
☆419Nov 15, 2018Updated 7 years ago
endymecy / AlgorithmsOnSpark
View on GitHub
Some popular algorithms(dbscan,knn,fm etc.) on spark
☆32May 29, 2018Updated 8 years ago
yahoo / TensorFlowOnSpark
View on GitHub
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,845Jul 10, 2023Updated 3 years ago
alitouka / spark_dbscan
View on GitHub
DBSCAN clustering algorithm on top of Apache Spark
☆264Mar 28, 2018Updated 8 years ago
NikhilGupta1997 / LookOut
View on GitHub
Beyond Outlier Detection: LookOut for Pictorial Explanation
☆27Nov 25, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hsperr / first_steps_in_scala
View on GitHub
☆14Aug 23, 2015Updated 10 years ago
banilo / nips2015
View on GitHub
☆14Feb 12, 2016Updated 10 years ago
LIBBLE / LIBBLE-Spark
View on GitHub
☆154Sep 17, 2018Updated 7 years ago
Clustering4Ever / Clustering4Ever
View on GitHub
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
☆132Jan 26, 2021Updated 5 years ago
avulanov / spark
View on GitHub
Mirror of Apache Spark
☆10Aug 11, 2016Updated 9 years ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
beckgael / Mean-Shift-LSH
View on GitHub
Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH
☆30May 2, 2019Updated 7 years ago
fabuzaid21 / yggdrasil
View on GitHub
Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark
☆30May 17, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ActivisionGameScience / python-kafka-benchmark
View on GitHub
☆15Jun 15, 2016Updated 10 years ago
iheartradio / asobu
View on GitHub
Asobu (遊ぶ) Library for building distributed REST APIs for microservices based on akka cluster and play
☆12Oct 20, 2016Updated 9 years ago
linkedin / isolation-forest
View on GitHub
A distributed Spark/Scala implementation of the isolation forest and extended isolation forest algorithms for unsupervised outlier detect…
☆260Jun 12, 2026Updated last month
twosigma / flint
View on GitHub
A Time Series Library for Apache Spark
☆1,172Jul 3, 2020Updated 6 years ago
mrsqueeze / spark-hash
View on GitHub
Locality Sensitive Hashing for Apache Spark
☆198Nov 1, 2016Updated 9 years ago
bellettif / sparkGeoTS
View on GitHub
☆12Apr 8, 2016Updated 10 years ago
lensacom / sparkit-learn
View on GitHub
PySpark + Scikit-learn = Sparkit-learn
☆1,151Dec 31, 2020Updated 5 years ago