loretoparisi / lshash
locality sensitive hashing (LSHASH) for Python3
☆69Updated last week
Alternatives and similar repositories for lshash
Users that are interested in lshash are comparing it to the libraries listed below
Sorting:
- Clustering for arbitrary data and dissimilarity function☆94Updated 11 months ago
- string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].☆61Updated 2 years ago
- Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner P…☆59Updated 4 years ago
- Table2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval☆23Updated 6 years ago
- This is a helper for PyTorch-BigGraph☆22Updated 5 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆67Updated last month
- ☆19Updated 5 years ago
- Applying Snorkel to SuperGLUE☆24Updated 5 years ago
- ☆16Updated 4 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- Neural LSH [ICLR 2020] - Using supervised learning to produce better space partitions for fast nearest neighbor search.☆73Updated 4 years ago
- ☆29Updated 7 years ago
- Code for ICML2019 paper: Learning to Route in Similarity Graphs☆58Updated 9 months ago
- deep entity resolution lite version☆11Updated 5 years ago
- Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度☆59Updated last year
- Break Wikidata dumps into smaller knowledge graphs☆42Updated 4 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- hnsw implemented by python☆66Updated 5 years ago
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Updated 3 years ago
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆39Updated 6 years ago
- (Personalized) Page-Rank computation using PyTorch☆90Updated 2 years ago
- Implementation of HNSW that supports online updates☆66Updated 7 years ago
- Scalable Hierarchical Clustering with Tree Grafting☆28Updated 2 years ago
- Fast graph-regularized matrix factorization☆20Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- RankDCG: ranking/ordering evaluation measure☆38Updated 3 years ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Updated 9 months ago