guofei9987 / pyLSHashLinks
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
☆63Updated 2 years ago
Alternatives and similar repositories for pyLSHash
Users that are interested in pyLSHash are comparing it to the libraries listed below
Sorting:
- Accurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)☆25Updated 4 years ago
- locality sensitive hashing (LSHASH) for Python3☆73Updated 8 months ago
- Foundation Models for Data Tasks☆110Updated 2 years ago
- Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner P…☆60Updated 5 years ago
- Clustering for arbitrary data and dissimilarity function☆99Updated last year
- Neural LSH [ICLR 2020] - Using supervised learning to produce better space partitions for fast nearest neighbor search.☆73Updated 5 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 4 years ago
- 使用Python复现SIGKDD2017的PAMAE算 法(并行k-medoids算法)/The Python implementation of SIGKDD 2017's PAMAE algorithm (parallel k-medoids algorithm)☆33Updated 6 years ago
- Python wrapper for LibRec and other recommendation frameworks.☆29Updated 2 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated 2 years ago
- hnsw implemented by python☆72Updated 6 years ago
- A truth inference tool in crowdsourcing☆13Updated 5 years ago
- An approach to perform RAG while taking into account the dynamic evolution of the data. Helpful to detect emerging trends in the data☆30Updated 2 years ago
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Updated last year
- Code for ICML2019 paper: Learning to Route in Similarity Graphs☆61Updated last year
- Some microbenchmarks and design docs before commencement☆12Updated 5 years ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Updated 3 weeks ago
- Official code for "Binary embedding based retrieval at Tencent"☆44Updated last year
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Updated 3 years ago
- A study of the downstream instability of word embeddings☆12Updated 3 years ago
- Hyperparameter tuning via uncertainty modeling☆49Updated last year
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆70Updated 2 months ago
- Fast and explainable clustering in Python☆124Updated last week
- Collection of datasets for benchmarking filtered vector similarity retrieval☆59Updated 8 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Updated 2 years ago
- Code and Supplementary Material to the Paper: Pairwise Learning to Rank by Neural Networks Revisited: Reconstruction, Theoretical Analysi…☆37Updated 2 years ago
- Evaluation framework for document processing models and services.☆63Updated last week
- [ICML-25] AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML☆88Updated 6 months ago
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆27Updated 6 years ago
- YATO: Yet Another deep learning based Text analysis Open toolkit☆47Updated 2 years ago