erdogant / undoubleLinks
Python package undouble is to detect (near-)identical images.
☆57Updated 3 months ago
Alternatives and similar repositories for undouble
Users that are interested in undouble are comparing it to the libraries listed below
Sorting:
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆158Updated 3 years ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆85Updated last year
- The largest multilingual image-text classification dataset. It contains fashion products.☆75Updated 2 years ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆68Updated 3 months ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- 🖍️ Highlight text in documents☆110Updated 8 months ago
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.☆223Updated 2 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- clustimage is a python package for unsupervised clustering of images.☆109Updated 2 weeks ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- A Jupyter widget for annotating images with bounding boxes☆142Updated last year
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆144Updated 8 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆154Updated last week
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Near-duplicate image detection using Locality Sensitive Hashing☆75Updated 4 years ago
- 🔤 Measure edit distance based on keyboard layout☆63Updated 2 months ago
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆164Updated 3 years ago
- ☆112Updated 4 years ago
- CLI utility to find near duplicate images and remove all but the best copy.☆169Updated this week
- Using efficientnet to provide embeddings for retrieval☆160Updated 2 years ago
- A quick and simple tool for labeling images, videos and time series data, right from Jupyter!☆44Updated 3 months ago
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆310Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆229Updated 2 years ago
- Generate embeddings for images and text using CLIP with LLM☆75Updated last year
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆164Updated last week
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.☆126Updated 3 years ago
- A zero-shot captcha solver.☆16Updated 2 years ago
- ☆15Updated 2 years ago