LSH based high dimensional clustering for sets and points
☆80Nov 15, 2014Updated 11 years ago
Alternatives and similar repositories for lshhdc
Users that are interested in lshhdc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pure python implementation of locality sensitive hashing for text documents☆87Oct 24, 2015Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆71Mar 13, 2015Updated 11 years ago
- Example Python code for comparing documents using MinHash☆251Feb 11, 2019Updated 7 years ago
- ☆32Nov 15, 2017Updated 8 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆150Sep 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆27Dec 1, 2015Updated 10 years ago
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆772Feb 23, 2023Updated 3 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Apr 4, 2017Updated 9 years ago
- My fork of zerofrog's fast SIFT C++ reimplementation of Bill Lowe's original smash-hit image-analysis algorithm.☆21Sep 19, 2012Updated 13 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Jul 1, 2015Updated 10 years ago
- ☆11May 15, 2017Updated 8 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,904Updated this week
- Tensorflow implementation of a supervised approach to learn highly compressed image representations☆26Nov 22, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆16May 31, 2017Updated 8 years ago
- TBEEF, a doubly ensemble framework for recommendation and prediction problems.☆20Apr 16, 2016Updated 10 years ago
- RNA-Skim: a rapid method for RNA-Seq quantification at transcript level☆19Sep 3, 2017Updated 8 years ago
- Gibbs sampling inference to LDA☆19Apr 4, 2014Updated 12 years ago
- Given the Live on board data of various drivers, a score corresponding to each driver is to be formulated, which will help insurance comp…☆12Sep 13, 2018Updated 7 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆26Aug 27, 2012Updated 13 years ago
- solution for the 5th place of cikm cup 2014☆19Jan 28, 2015Updated 11 years ago
- A lightweight command line interface for the management of arbitrary machine learning tasks☆19Jan 29, 2021Updated 5 years ago
- Implementation of a pairwise document similarity algorithm using MapReduce.☆15Nov 16, 2011Updated 14 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for our DLS'21 paper - BODMAS: An Open Dataset for Learning based Temporal Analysis of PE Malware. BODMAS is short for Blue Hexagon …☆92Mar 31, 2024Updated 2 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 4 years ago
- This is a repository in which we take part in the big data competition, focusing on recommendation system.☆17May 24, 2016Updated 9 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆111Aug 8, 2014Updated 11 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- Learning word embeddings with AdaGrad and Noise Contrastive Estimation, C++ 11.☆13Sep 22, 2014Updated 11 years ago
- ☆12Apr 17, 2021Updated 5 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Aug 20, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- brew for Windows (legacy)☆33Aug 17, 2013Updated 12 years ago
- Event Time Extraction with a Decision Tree of Neural Classifiers☆18Feb 28, 2019Updated 7 years ago
- Locality-Sensitive Hashing for Minhash Signatures☆12Sep 12, 2013Updated 12 years ago
- ☆15Feb 19, 2016Updated 10 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆219Oct 7, 2021Updated 4 years ago
- ☆13Mar 1, 2019Updated 7 years ago