erdogant / undouble
Python package undouble is to detect (near-)identical images.
☆51Updated last month
Alternatives and similar repositories for undouble:
Users that are interested in undouble are comparing it to the libraries listed below
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 2 years ago
- Pure Python implementation of the XZ file format with random access support☆27Updated 2 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆158Updated 2 years ago
- Python JSON benchmarking and "correctness".☆31Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆136Updated 3 months ago
- ☆19Updated last week
- Generate embeddings for images and text using CLIP with LLM☆67Updated 11 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated 11 months ago
- ☆40Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆111Updated 3 years ago
- Blazingly fast Markdown parser for Python written in Rust.☆34Updated this week
- Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.☆37Updated this week
- pdfrw is a pure Python library that reads and writes PDFs☆31Updated 2 years ago
- A tool to create dependency graphs of ideas (useful for presentation or teaching)☆12Updated 5 months ago
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆30Updated last year
- 🔤 Measure edit distance based on keyboard layout☆60Updated last year
- A modular graph based DataSet implementation for Pytorch☆31Updated last week
- Tree-based indexes for neural-search☆29Updated last year
- A fun party trick to run Python code from another venv into this one.☆182Updated 2 weeks ago
- Python SDK for XetHub☆49Updated 5 months ago
- Creates an index of images, queries a local LLM and adds tags to the image metadata☆138Updated this week
- Effective frame sampling for ML applications.☆18Updated 3 months ago
- A Python library for extracting color palettes from supplied images.☆133Updated 2 weeks ago
- 🔢 Work with static vector models☆23Updated 2 months ago
- clustimage is a python package for unsupervised clustering of images.☆103Updated 3 weeks ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆13Updated 8 months ago
- An API for VoiceCraft.☆25Updated 9 months ago
- Karras et al. (2022) diffusion models for PyTorch☆19Updated 10 months ago