Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)
☆30Feb 4, 2013Updated 13 years ago
Alternatives and similar repositories for MinHash
Users that are interested in MinHash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LSH based high dimensional clustering for sets and points☆80Nov 15, 2014Updated 11 years ago
- A pure python implementation of locality sensitive hashing for text documents☆87Oct 24, 2015Updated 10 years ago
- Music Recommendations with Collaborative Filtering and Cosine Distance☆39Sep 15, 2016Updated 9 years ago
- a testimonials app for Django☆27Jun 19, 2021Updated 4 years ago
- facebook link prediction kaggle challenge.☆15Aug 10, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Example Python code for comparing documents using MinHash☆251Feb 11, 2019Updated 7 years ago
- Now you can play Lego 21323☆10Dec 18, 2020Updated 5 years ago
- Suite of tools for game developers building on MUD☆12Mar 13, 2024Updated 2 years ago
- Custom ZipFileField for Django that auto compact file uploaded☆19Apr 13, 2026Updated 2 weeks ago
- ctypes bindings for libphash to robustly compare media files☆12Dec 28, 2022Updated 3 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Aug 20, 2015Updated 10 years ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆30Jun 14, 2012Updated 13 years ago
- A set of abstract and concrete models for your Wagtail website.☆12Apr 20, 2021Updated 5 years ago
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Geopandas and Shapely☆10Jul 29, 2018Updated 7 years ago
- Django-docopt-command allows you to write Django manage.py commands using the docopt library☆26May 21, 2024Updated last year
- ☆16Sep 19, 2017Updated 8 years ago
- Sadnbox of Spark-notebook☆10Mar 19, 2016Updated 10 years ago
- keyboard_shortcuts roundcube plugin☆26Mar 9, 2023Updated 3 years ago
- A ChatGPT plugin for Solana☆13Jun 1, 2023Updated 2 years ago
- ☆14Aug 22, 2025Updated 8 months ago
- Network timing evaluation used to detect beacons, works with argus flow as the source☆20May 4, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hawk HTTP Authorization for Django Rest Framework☆19Jul 28, 2020Updated 5 years ago
- Portable Linked Profiles documentation☆26Jan 6, 2016Updated 10 years ago
- SQLite handler for python logging. Allow to write log messages to sqlite database. Tested on python 3.2☆16Jun 22, 2012Updated 13 years ago
- Arxiv crawler written in python☆13Jun 17, 2012Updated 13 years ago
- Python library to get the Alexa rank of the domain of any URL☆10Jan 28, 2013Updated 13 years ago
- TopK Algorithms Benchmark☆10Jul 16, 2019Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Jan 12, 2026Updated 3 months ago
- PoC for Validating Keycloak Configurations with Open Policy Agent Polices☆11Sep 13, 2022Updated 3 years ago
- A tutorial on entity resolution (record linkage or de-duplication)☆65Jun 30, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Mar 20, 2018Updated 8 years ago
- Consists all the try outs and assignments in AIML program of great lakes☆10Jun 11, 2020Updated 5 years ago
- Finding Overlapping Communities in Social Networks☆10Feb 12, 2014Updated 12 years ago
- Test how readable the content you enter into wagtail is.☆16Mar 8, 2016Updated 10 years ago
- Locality-Sensitive Hashing for Minhash Signatures☆12Sep 12, 2013Updated 12 years ago
- A django app that allows the easy addition of EpicEditor markdown editor to a django form field, whether in a custom app or the Django Ad…☆38Sep 2, 2014Updated 11 years ago
- A python script to scrape predictit data and upload it to a mysql database☆12May 29, 2017Updated 8 years ago