Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.
☆30Jul 17, 2015Updated 10 years ago
Alternatives and similar repositories for hyperplane-hasher
Users that are interested in hyperplane-hasher are comparing it to the libraries listed below
Sorting:
- Implement Natural Language Object Retrieval in tensorflow☆35Nov 30, 2016Updated 9 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 8 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- Jeremy's Machine Learning Library☆32Aug 19, 2017Updated 8 years ago
- Price options by fitting a Lévy distribution☆10Jan 20, 2021Updated 5 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- USAAR participation in SemEval2015☆11Dec 21, 2022Updated 3 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Feb 27, 2014Updated 12 years ago
- A Cython interface to FLANN☆24Nov 25, 2020Updated 5 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Jan 17, 2015Updated 11 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Jun 21, 2022Updated 3 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Sparse Beta-Divergence Tensor Factorization Library☆48Jun 2, 2025Updated 8 months ago
- Instructions for deploying Kubeflow on EKS and minikube☆15Jun 25, 2021Updated 4 years ago
- An autoencoder to calculate word embeddings as mentioned in Lebret/Collobert paper 2015☆74Jan 19, 2017Updated 9 years ago
- A streaming cross-cat inference engine☆49Dec 19, 2014Updated 11 years ago
- provides iSAX Java implementation☆14Jun 13, 2015Updated 10 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Oct 28, 2015Updated 10 years ago
- ☆26Feb 12, 2017Updated 9 years ago
- Using social media to steer web archiving and curation.☆18Nov 20, 2015Updated 10 years ago
- Standalone Semanticizer☆32Mar 4, 2015Updated 10 years ago
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- Fast structured perceptron sequential labeler☆15Dec 8, 2015Updated 10 years ago
- Links parts of input text to Wikipedia articles☆16Sep 9, 2012Updated 13 years ago
- ☆15Dec 14, 2020Updated 5 years ago
- Faster, simpler Django content management☆36Aug 24, 2018Updated 7 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Oct 2, 2011Updated 14 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Aug 8, 2016Updated 9 years ago
- Python code for training Paragram word embeddings. These achieve human-level performance on some word similiarty tasks including SimLex-9…☆30Feb 4, 2016Updated 10 years ago
- It is a forest of random projection trees☆225Feb 8, 2020Updated 6 years ago
- Content-based Recommendation Generator☆13Jan 21, 2015Updated 11 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- ☆11Sep 1, 2016Updated 9 years ago
- Rectified Factor Networks☆37Oct 16, 2019Updated 6 years ago
- The useful and used parts of NN-Dropout☆25Jun 4, 2015Updated 10 years ago
- Simple ranking metrics for PyTorch on CPU or GPU☆15Nov 20, 2020Updated 5 years ago
- Implementation of Frequent-Directions algorithm for efficient matrix sketching [E. Liberty, SIGKDD2013]☆27Apr 19, 2015Updated 10 years ago