erdogant / undoubleLinks
Python package undouble is to detect (near-)identical images.
☆51Updated 3 weeks ago
Alternatives and similar repositories for undouble
Users that are interested in undouble are comparing it to the libraries listed below
Sorting:
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 3 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆152Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆162Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 8 months ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆74Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- A Jupyter widget for annotating images with bounding boxes☆136Updated last year
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆142Updated 5 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆66Updated this week
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- ☆30Updated 3 years ago
- ☆112Updated 4 years ago
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆305Updated last year
- My NER Experiments with ModernBERT and Ettin☆22Updated 2 months ago
- 🖍️ Highlight text in documents☆109Updated 5 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Efficiently read embedding in streaming from any filesystem☆102Updated last month
- Evalica, your favourite evaluation toolkit☆57Updated 2 weeks ago
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆187Updated last year
- 🔤 Measure edit distance based on keyboard layout☆61Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- Loan Risk Prediction Neural Network and API☆17Updated 4 years ago
- A file utility for accessing both local and remote files through a unified interface.☆44Updated 4 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Datamodels for hugging face tokenizers☆71Updated this week
- Traversing links to find the deep source of information☆69Updated 2 years ago