guofei9987 / pyLSHashLinks
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
☆59Updated last year
Alternatives and similar repositories for pyLSHash
Users that are interested in pyLSHash are comparing it to the libraries listed below
Sorting:
- Accurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)☆26Updated 4 years ago
- Clustering for arbitrary data and dissimilarity function☆96Updated last year
- locality sensitive hashing (LSHASH) for Python3☆70Updated 3 months ago
- hnsw implemented by python☆69Updated 6 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated last year
- Neural LSH [ICLR 2020] - Using supervised learning to produce better space partitions for fast nearest neighbor search.☆73Updated 4 years ago
- Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner P…☆59Updated 4 years ago
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Updated 2 years ago
- Foundation Models for Data Tasks☆108Updated 2 years ago
- A framework for index based similarity search.☆19Updated 6 years ago
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆70Updated 5 months ago
- Python wrapper for LibRec and other recommendation frameworks.☆29Updated last year
- YATO: Yet Another deep learning based Text analysis Open toolkit☆46Updated last year
- super fast cpp implementation of longest common subsequence/substring☆71Updated last year
- Repository for Multimodal AutoML Benchmark☆65Updated 3 years ago
- All the code for a series of Medium articles on Approximate Nearest Neighbors☆45Updated 2 years ago
- 使用Python复现SIGKDD2017的PAMAE算法(并行k-medoids算法)/The Python implementation of SIGKDD 2017's PAMAE algorithm (parallel k-medoids algorithm)☆33Updated 5 years ago
- Pure python implementation of product quantization for nearest neighbor search☆350Updated 2 months ago
- 1) SimRank (single pair query, parallel all pair computation / dynamic updates) - by Yue Wang (https://github.com/KeithYue) and Yulin Che…☆14Updated 5 years ago
- Official code for "Binary embedding based retrieval at Tencent"☆43Updated last year
- Crawl information of papers from ACM/IEEE/arXiv/AAAI digital library.☆39Updated 4 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Code for ICML2019 paper: Learning to Route in Similarity Graphs☆58Updated last year
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- Joint Optimization of Cascade Ranking Models (WSDM 19)☆13Updated 3 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Updated 2 years ago
- [ICML-25] AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML☆30Updated last month
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆64Updated 11 months ago
- Recommendation algorithms for large graphs☆28Updated 7 months ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆291Updated 2 years ago