mrsqueeze/spark-hash

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mrsqueeze/spark-hash)

mrsqueeze / spark-hash

Locality Sensitive Hashing for Apache Spark

☆198

Alternatives and similar repositories for spark-hash

Users that are interested in spark-hash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

marufaytekin / lsh-spark
View on GitHub
Locality Sensitive Hashing for Apache Spark
☆87Feb 5, 2022Updated 4 years ago
karlhigley / spark-neighbors
View on GitHub
Spark-based approximate nearest neighbor search using locality-sensitive hashing
☆104Jul 5, 2016Updated 10 years ago
tmpsrcrepo / benchmark_minhash_lsh
View on GitHub
insight data engineering fellow project
☆16Nov 14, 2016Updated 9 years ago
barneygovan / lsh-scala
View on GitHub
A Locality-Sensitive Hashing Library for Scala with optional Redis storage.
☆17Jan 5, 2022Updated 4 years ago
magsol / pyspark-lsh
View on GitHub
Locality-sensitive hashing in PySpark.
☆27Mar 11, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
collectivemedia / spark-hyperloglog
View on GitHub
Interactive Audience Analytics with Spark and HyperLogLog
☆55Oct 14, 2015Updated 10 years ago
amplab / spark-indexedrdd
View on GitHub
An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
AtlasPilotPuppy / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
brkyvz / lazy-linalg
View on GitHub
A package full of linear algebra operators for Apache Spark MLlib's linalg package
☆10Sep 9, 2015Updated 10 years ago
phatak-dev / anatomy-of-rdd
View on GitHub
☆21Mar 27, 2015Updated 11 years ago
avulanov / ann-benchmark
View on GitHub
Benchmarks of artificial neural network library for Spark MLlib
☆11Dec 3, 2015Updated 10 years ago
saurfang / spark-tsne
View on GitHub
Distributed t-SNE via Apache Spark
☆158Dec 9, 2017Updated 8 years ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rjagerman / glint
View on GitHub
Glint: High performance scala parameter server
☆170Jul 20, 2018Updated 8 years ago
entropyltd / spark-cloud
View on GitHub
Spark-cloud is a set of scripts for starting spark clusters on ec2
☆12Dec 21, 2015Updated 10 years ago
brkyvz / streaming-matrix-factorization
View on GitHub
Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems
☆109Mar 25, 2016Updated 10 years ago
hammerlab / spree
View on GitHub
Live-updating Spark UI built with Meteor
☆190Apr 6, 2021Updated 5 years ago
beckgael / Mean-Shift-LSH
View on GitHub
Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH
☆30May 2, 2019Updated 7 years ago
evancasey / spark-knn-recommender
View on GitHub
Item and User-based KNN recommendation algorithms using PySpark
☆124Nov 14, 2017Updated 8 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
apache / incubator-toree
View on GitHub
Mirror of Apache Toree (Incubating)
☆750Updated this week
LanceNorskog / LSH-Hadoop
View on GitHub
Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations
☆28Oct 15, 2011Updated 14 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
databricks / spark-redshift
View on GitHub
Redshift data source for Apache Spark
☆608Aug 10, 2023Updated 2 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
malcolmgreaves / data-tc
View on GitHub
A type class for data of all sizes.
☆15Jul 9, 2019Updated 7 years ago
databricks / spark-tfocs
View on GitHub
A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)
☆90Apr 15, 2024Updated 2 years ago
tresata / spark-scalding
View on GitHub
Use Cascading Taps and Scalding DSL with Spark
☆49Dec 28, 2016Updated 9 years ago
SHSE / spark-es
View on GitHub
ElasticSearch integration for Apache Spark
☆47Apr 5, 2016Updated 10 years ago
ceteri / spark-exercises
View on GitHub
Coding exercises for Apache Spark
☆103Jun 4, 2015Updated 11 years ago
flaxsearch / lucene-solr-intervals
View on GitHub
Flax-maintained fork of Lucene/Solr with support for interval queries
☆15Oct 9, 2015Updated 10 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
amplab / SparkNet
View on GitHub
Distributed Neural Networks for Spark
☆610Jul 23, 2020Updated 6 years ago
takahi-i / likelike
View on GitHub
An implementation of locality sensitive hashing with Hadoop
☆58Feb 5, 2015Updated 11 years ago
cne1x / sfcs
View on GitHub
Space-Filling Curves in Scala
☆26Aug 25, 2020Updated 5 years ago
adobe-research / spindle
View on GitHub
Next-generation web analytics processing with Scala, Spark, and Parquet.
☆330Mar 28, 2015Updated 11 years ago
dask / knit
View on GitHub
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
☆54Jul 3, 2018Updated 8 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,836Mar 3, 2026Updated 4 months ago