erdogant / undoubleLinks
Python package undouble is to detect (near-)identical images.
☆57Updated 2 months ago
Alternatives and similar repositories for undouble
Users that are interested in undouble are comparing it to the libraries listed below
Sorting:
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆29Updated 2 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆155Updated 3 years ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆82Updated last year
- Traversing links to find the deep source of information☆69Updated 2 years ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆75Updated 2 years ago
- Utilities for working with videos☆13Updated 4 months ago
- 🔤 Measure edit distance based on keyboard layout☆61Updated last month
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆185Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆149Updated last week
- Evalica, your favourite evaluation toolkit☆60Updated last week
- A Jupyter widget for annotating images with bounding boxes☆135Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆74Updated this week
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.☆223Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 10 months ago
- ☆14Updated 5 years ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆35Updated 3 years ago
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆308Updated last year
- ☆14Updated last year
- CLI utility to find near duplicate images and remove all but the best copy.☆166Updated last week
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆67Updated last month
- Demo example of consumer goods categorization☆29Updated last year
- 🖍️ Highlight text in documents☆109Updated 6 months ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆142Updated 7 months ago
- ☆19Updated last month
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- ☆45Updated 2 years ago