mendesk / image-ndd-lshLinks
Near-duplicate image detection using Locality Sensitive Hashing
☆75Updated 4 years ago
Alternatives and similar repositories for image-ndd-lsh
Users that are interested in image-ndd-lsh are comparing it to the libraries listed below
Sorting:
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆164Updated 3 years ago
- locality sensitive hashing (LSHASH) for Python3☆73Updated 7 months ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated 2 months ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Clustering for arbitrary data and dissimilarity function☆99Updated last year
- Implementation of IncrementalDBSCAN clustering.☆81Updated 4 months ago
- Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.☆83Updated 2 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- ☆43Updated 2 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 11 months ago
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆144Updated 8 months ago
- Simply, faster, sentence-transformers☆143Updated last year
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆158Updated 3 years ago
- Bi-encoder entity linking architecture☆52Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- The largest multilingual image-text classification dataset. It contains fashion products.☆75Updated 2 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Updated 6 months ago
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Updated last year
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆318Updated last year
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆292Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆77Updated 3 weeks ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peete…☆38Updated 3 years ago
- How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.☆135Updated 3 years ago
- Framework to build your own reverse image search engine☆82Updated 5 years ago
- Creating class-based TF-IDF matrices☆91Updated 3 years ago