loretoparisi / lshash
locality sensitive hashing (LSHASH) for Python3
☆63Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for lshash
- Clustering for arbitrary data and dissimilarity function☆87Updated 5 months ago
- Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner P…☆55Updated 3 years ago
- KDD21 Deep Learning Embeddings for Data Series Similarity Search☆19Updated 3 years ago
- Neural LSH [ICLR 2020] - Using supervised learning to produce better space partitions for fast nearest neighbor search.☆71Updated 3 years ago
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆64Updated 2 months ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 2 years ago
- Recommendation algorithms for large graphs☆29Updated 5 months ago
- string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].☆60Updated last year
- hnsw implemented by python☆62Updated 5 years ago
- Introduction Notebook to Extreme Multi-Label Classification problem (XML)☆23Updated 6 years ago
- Fast C++ implementation of https://github.com/yahoo/lopq: Locally Optimized Product Quantization (LOPQ) model and searcher for approximat…☆34Updated 4 years ago
- Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度☆53Updated 10 months ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 3 years ago
- Code for ICML2019 paper: Learning to Route in Similarity Graphs☆57Updated 3 months ago
- Code for paper: Towards Similarity Graphs Constructed by Deep Reinforcement Learning☆19Updated 4 years ago
- This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peete…☆35Updated last year
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆40Updated 6 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆45Updated 6 years ago
- Implementation of the paper Identifying Mislabeled Data using the Area Under the Margin Ranking: https://arxiv.org/pdf/2001.10528v2.pdf☆21Updated 4 years ago
- A few-shot learning method based on siamese networks.☆28Updated last year
- Source code for SIGMOD 2020 paper "Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination"☆45Updated 4 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆17Updated 3 years ago
- Scalable Hierarchical Clustering with Tree Grafting☆28Updated 2 years ago
- To reproduce experiments of the paper "Entity Matching with Transformer Architectures"☆27Updated 5 years ago
- ☆32Updated 6 years ago
- (Personalized) Page-Rank computation using PyTorch☆86Updated last year
- deep entity resolution lite version☆11Updated 5 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆19Updated 11 months ago
- Accurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)☆24Updated 3 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆21Updated 2 years ago