rahularora / MinHashLinks
Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)
☆31Updated 12 years ago
Alternatives and similar repositories for MinHash
Users that are interested in MinHash are comparing it to the libraries listed below
Sorting:
- LSH based high dimensional clustering for sets and points☆79Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆147Updated 10 months ago
- Example Python code for comparing documents using MinHash☆251Updated 6 years ago
- CrowdRec reference framework☆32Updated 8 years ago
- lightweight python wrapper for vowpal wabbit☆168Updated 5 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆79Updated 11 years ago
- It is a forest of random projection trees☆223Updated 5 years ago
- Recommender System Framework☆125Updated 8 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- FluRS: A Python library for streaming recommendation algorithms☆109Updated 3 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- A few data mining algorithms in pure python☆469Updated 9 years ago
- Topological Anomaly Detection (TAD) per Gartley and Basener 2009☆69Updated 5 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Solution to Facebook's link prediction contest on Kaggle.☆205Updated 13 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- locality sensitive hashing☆71Updated 13 years ago
- RiVal recommender system evaluation toolkit☆150Updated 6 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆206Updated 7 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- Additive Groves, Bagged Trees with Feature Evaluation, Interaction Detection, Visualization of Feature Effects.☆65Updated 4 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Community Detection Research Effort☆79Updated 9 years ago
- Compute and plot NDCG for a recommender system☆95Updated 7 years ago
- How to use automatic polynomial features and neural network mode in VW☆17Updated 11 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated last year
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago