karlhigley / spark-neighborsView external linksLinks
Spark-based approximate nearest neighbor search using locality-sensitive hashing
☆104Jul 5, 2016Updated 9 years ago
Alternatives and similar repositories for spark-neighbors
Users that are interested in spark-neighbors are comparing it to the libraries listed below
Sorting:
- Locality Sensitive Hashing for Apache Spark☆87Feb 5, 2022Updated 4 years ago
- Locality Sensitive Hashing for Apache Spark☆196Nov 1, 2016Updated 9 years ago
- Spark algorithms for building k-nn graphs☆41Nov 26, 2018Updated 7 years ago
- Big Spatial Data Processing using Spark☆146Mar 7, 2017Updated 8 years ago
- NCG acceleration of ALS computing low rank matrix factorizations for Collaborative Filtering☆14Feb 15, 2016Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Locality-sensitive hashing in PySpark.☆27Mar 11, 2015Updated 10 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- A scalable nearest neighbor search library in Apache Spark☆262Mar 29, 2019Updated 6 years ago
- A SQL-esque scripting language for spatial processing and ETL☆11Mar 4, 2019Updated 6 years ago
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 9 years ago
- Algorithms from the book "Elements of Statistical Learning", implemented in Python☆12Mar 29, 2015Updated 10 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Tutorial on Web Table Extraction, Retrieval and Augmentation☆11Mar 28, 2020Updated 5 years ago
- DBSCAN algorithm implemented in TensorFlow.☆10Apr 17, 2019Updated 6 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Nov 17, 2018Updated 7 years ago
- A library for parsing and querying an Esri File Geodatabase with Apache Spark.☆26Nov 13, 2016Updated 9 years ago
- API and libraries for generating travelsheds from OSM & GTFS data☆40Jul 14, 2018Updated 7 years ago
- Theano implementation of GloVe for graphs☆47Jul 5, 2015Updated 10 years ago
- RankNet, LambdaRank, LambdaMART, GBrank☆14Nov 16, 2013Updated 12 years ago
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- Notes on Lambda Architecture☆12Feb 9, 2018Updated 8 years ago
- ☆13Nov 2, 2017Updated 8 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago
- Use word2vec embedding with LSTM for the "Bag of Words meets Bag of Popcorn" challenge☆16May 12, 2017Updated 8 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- Global Vectors for Word Representation on spark☆35Oct 30, 2014Updated 11 years ago
- Approximate nearest neighbors in Java☆144Oct 13, 2020Updated 5 years ago
- SBT plugin for running FindBugs on Java classes☆14Sep 26, 2020Updated 5 years ago
- This is a minimal acyclic finite-state automata algorithm in Java based on the paper, "Incremental Construction of Minimal Acyclic Finite…☆20Dec 31, 2013Updated 12 years ago
- A Locality-Sensitive Hashing Library for Scala with optional Redis storage.☆16Jan 5, 2022Updated 4 years ago
- benchmarking positional population count☆17Oct 19, 2025Updated 3 months ago
- A Distributed Matrix Operations Library Built on Top of Spark☆109Dec 28, 2016Updated 9 years ago
- Reactive Factorization Engine☆104Feb 18, 2015Updated 10 years ago
- An implement of Factorization Machines (LibFM)☆250Aug 13, 2018Updated 7 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆476Apr 18, 2017Updated 8 years ago