Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)
☆30Feb 4, 2013Updated 13 years ago
Alternatives and similar repositories for MinHash
Users that are interested in MinHash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pure python implementation of locality sensitive hashing for text documents☆87Oct 24, 2015Updated 10 years ago
- Music Recommendations with Collaborative Filtering and Cosine Distance☆39Sep 15, 2016Updated 9 years ago
- Example Python code for comparing documents using MinHash☆252Feb 11, 2019Updated 7 years ago
- Troll the NSA with red flags and free speech! Flagger is a Firefox and Chrome extension that adds words like "Taliban" and "anthrax" into…☆16Aug 21, 2021Updated 4 years ago
- Now you can play Lego 21323☆10Dec 18, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Easy, designer-friendly page creation and A/B testing tools for Django.☆34Feb 27, 2025Updated last year
- Suite of tools for game developers building on MUD☆12Mar 13, 2024Updated 2 years ago
- Custom ZipFileField for Django that auto compact file uploaded☆19Apr 13, 2026Updated last month
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Aug 20, 2015Updated 10 years ago
- A set of abstract and concrete models for your Wagtail website.☆12Apr 20, 2021Updated 5 years ago
- Visualizations of character embeddings from derived character vectors.☆13Apr 4, 2017Updated 9 years ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Geopandas and Shapely☆10Jul 29, 2018Updated 7 years ago
- Plateforme d'échanges entre particuliers avec monnaie virtuelle et revenu de base☆16Aug 15, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Command line utility for d3-pre pre-rendering pipeline☆13Jul 14, 2016Updated 9 years ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 9 years ago
- A ChatGPT plugin for Solana☆13Jun 1, 2023Updated 3 years ago
- Hawk HTTP Authorization for Django Rest Framework☆19Jul 28, 2020Updated 5 years ago
- Portable Linked Profiles documentation☆26Jan 6, 2016Updated 10 years ago
- Arxiv crawler written in python☆13Jun 17, 2012Updated 13 years ago
- Data Science Course Materials - Fall 2014☆12Sep 6, 2014Updated 11 years ago
- Python library to get the Alexa rank of the domain of any URL☆10Jan 28, 2013Updated 13 years ago
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Mar 20, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for in class material for Data Bootcamp☆14May 18, 2019Updated 7 years ago
- Consists all the try outs and assignments in AIML program of great lakes☆10Jun 11, 2020Updated 5 years ago
- Simple Programmable R NSE☆14Feb 16, 2018Updated 8 years ago
- A django app that allows the easy addition of EpicEditor markdown editor to a django form field, whether in a custom app or the Django Ad…☆38Sep 2, 2014Updated 11 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆14Oct 12, 2016Updated 9 years ago
- A python script to scrape predictit data and upload it to a mysql database☆11May 29, 2017Updated 9 years ago
- A compendium of data projects and associated blog posts☆10Nov 4, 2019Updated 6 years ago
- RhymerFinder predicts the rhymes in a song based on preceding lyrics by using gensim's Word2Vec implementation.☆10Aug 15, 2017Updated 8 years ago
- sklearn implementation of gap-statistic☆10May 25, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Team project using natural language to query blockchain data☆13Apr 17, 2023Updated 3 years ago
- A mezzanine flavored fork of django-flatblocks. The goal of this project is to be able to easily create custom blocks of HTML in the temp…☆48Oct 23, 2022Updated 3 years ago
- dotfiles☆15Updated this week
- A JavaScript implementation of the Logic Programming System described in section 4.4 of "Structure and Interpretation of Computer Program…☆22Jan 25, 2012Updated 14 years ago
- PoC PredictIt auto trading bot on tweet count markets☆15Jul 12, 2017Updated 8 years ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Ontologie PAIR: Projets, Acteurs, Idées, Ressources☆23Jan 8, 2026Updated 5 months ago