erdogant / undouble
Python package undouble is to detect (near-)identical images.
☆50Updated 3 weeks ago
Alternatives and similar repositories for undouble
Users that are interested in undouble are comparing it to the libraries listed below
Sorting:
- A quick and simple tool for labeling images, videos and time series data, right from Jupyter!☆40Updated 6 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆13Updated 9 months ago
- Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.☆40Updated this week
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆30Updated last year
- Utilities for working with videos☆13Updated 3 years ago
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 6 months ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 2 years ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆45Updated last year
- Python JSON benchmarking and "correctness".☆33Updated last year
- Extract knowledge from raw text☆13Updated 3 years ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated last year
- Python package for deduplication/entity resolution using active learning☆79Updated 8 months ago
- ☆20Updated 10 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆54Updated last month
- Benchmarking vision language vision on face tasks☆13Updated last month
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆69Updated last month
- Algorithms for similar image search/reverse image search☆36Updated 2 years ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆112Updated this week
- Pyterator helps you write fluent interfaces for collections☆9Updated 2 years ago
- A Python implementation of Lunr.js 🌖☆195Updated 2 months ago
- Python package for extractive NLP using the OpenAI API☆17Updated 8 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆62Updated 4 months ago
- Home to jupyter notebooks for Mindee OSS projects☆17Updated 7 months ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆72Updated last year
- A system for reading scanned documents and grouping them into high level topics☆16Updated 4 years ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- convert pixels to SVG square-based shapes☆46Updated 2 years ago
- MinHash implementation in Python☆11Updated 8 months ago