Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)
☆30Feb 4, 2013Updated 13 years ago
Alternatives and similar repositories for MinHash
Users that are interested in MinHash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LSH based high dimensional clustering for sets and points☆80Nov 15, 2014Updated 11 years ago
- A pure python implementation of locality sensitive hashing for text documents☆87Oct 24, 2015Updated 10 years ago
- a testimonials app for Django☆27Jun 19, 2021Updated 5 years ago
- facebook link prediction kaggle challenge.☆15Aug 10, 2014Updated 11 years ago
- This is a Spark implementation (Python API) for Distributed Stochastic Gradient Descent based on☆12Apr 18, 2015Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Example Python code for comparing documents using MinHash☆252Feb 11, 2019Updated 7 years ago
- WARNING: This repo isn't maintained anymore! Drop-in CSS enhancements for Wagtail's Streamfield☆11Feb 27, 2018Updated 8 years ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆30Jun 14, 2012Updated 14 years ago
- A streaming algorithm for graph clustering☆12Dec 6, 2017Updated 8 years ago
- Visualizations of character embeddings from derived character vectors.☆13Apr 4, 2017Updated 9 years ago
- A copy of the source for Grinstead and Snell's lovely probability book☆13Dec 20, 2015Updated 10 years ago
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- ☆10Jan 14, 2015Updated 11 years ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Django-docopt-command allows you to write Django manage.py commands using the docopt library☆26May 21, 2024Updated 2 years ago
- ☆16Sep 19, 2017Updated 8 years ago
- Plateforme d'échanges entre particuliers avec monnaie virtuelle et revenu de base☆16Aug 15, 2017Updated 8 years ago
- keyboard_shortcuts roundcube plugin☆26Mar 9, 2023Updated 3 years ago
- Command line utility for d3-pre pre-rendering pipeline☆13Jul 14, 2016Updated 9 years ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 9 years ago
- ☆14May 21, 2026Updated last month
- ☆10Dec 12, 2023Updated 2 years ago
- An rope jumping application on Android and Apple Watch☆12Sep 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Entitypedia is an Extended Named Entity Dictionary from Wikipedia.☆13Dec 7, 2022Updated 3 years ago
- Network timing evaluation used to detect beacons, works with argus flow as the source☆20May 4, 2016Updated 10 years ago
- Hawk HTTP Authorization for Django Rest Framework☆19Jul 28, 2020Updated 5 years ago
- SQLite handler for python logging. Allow to write log messages to sqlite database. Tested on python 3.2☆16Jun 22, 2012Updated 14 years ago
- Python library to get the Alexa rank of the domain of any URL☆10Jan 28, 2013Updated 13 years ago
- ☆12Mar 28, 2023Updated 3 years ago
- Hands-On Machine Learning Using Amazon SageMaker [video], published by Packt☆16Dec 8, 2022Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Jan 12, 2026Updated 5 months ago
- community detection for the whole Twitter graph on a single laptop☆21Nov 21, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Mar 20, 2018Updated 8 years ago
- ☆12May 27, 2026Updated last month
- Repository for in class material for Data Bootcamp☆14May 18, 2019Updated 7 years ago
- Consists all the try outs and assignments in AIML program of great lakes☆10Jun 11, 2020Updated 6 years ago
- Finding Overlapping Communities in Social Networks☆10Feb 12, 2014Updated 12 years ago
- ☆12Mar 1, 2025Updated last year
- Jupyter Notebooks for Bussiness2Vector☆13Jun 28, 2018Updated 8 years ago