Locality Sensitive Hashing for Apache Spark
☆198Nov 1, 2016Updated 9 years ago
Alternatives and similar repositories for spark-hash
Users that are interested in spark-hash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Locality Sensitive Hashing for Apache Spark☆87Feb 5, 2022Updated 4 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆105Jul 5, 2016Updated 9 years ago
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- A Scala library for locality sensitive hashing☆14Aug 1, 2018Updated 7 years ago
- A Locality-Sensitive Hashing Library for Scala with optional Redis storage.☆16Jan 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Locality-sensitive hashing in PySpark.☆27Mar 11, 2015Updated 11 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- An efficient updatable key-value store for Apache Spark☆255Mar 11, 2017Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- ☆21Mar 27, 2015Updated 11 years ago
- Distributed t-SNE via Apache Spark☆159Dec 9, 2017Updated 8 years ago
- A library for time series analysis on Apache Spark☆1,199Oct 13, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 7 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆109Mar 25, 2016Updated 10 years ago
- Live-updating Spark UI built with Meteor☆190Apr 6, 2021Updated 5 years ago
- Mirror of Apache Toree (Incubating)☆750Apr 2, 2026Updated last month
- Item and User-based KNN recommendation algorithms using PySpark☆124Nov 14, 2017Updated 8 years ago
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A type class for data of all sizes.☆15Jul 9, 2019Updated 6 years ago
- ElasticSearch integration for Apache Spark☆47Apr 5, 2016Updated 10 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 2 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- An implementation of locality sensitive hashing with Hadoop☆58Feb 5, 2015Updated 11 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆54Jul 3, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Space-Filling Curves in Scala☆26Aug 25, 2020Updated 5 years ago
- ☆13Nov 2, 2015Updated 10 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated 2 months ago
- Topic Modeling with LDA in Scala and Spark☆31Sep 25, 2018Updated 7 years ago
- Scala port of the word2vec toolkit.☆47Aug 20, 2014Updated 11 years ago
- Gaussian Mixture Model Implementation in Pyspark☆31Dec 2, 2014Updated 11 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago