fake-name / IntraArchiveDeduplicator
Tool for managing data-deduplication within extant compressed archive files, along with a relatively performant BK tree implementation for fuzzy image searching.
☆98Updated last year
Related projects ⓘ
Alternatives and complementary repositories for IntraArchiveDeduplicator
- Detect source resolution of upscaled images☆238Updated 7 months ago
- Fast hamming-distance range searches via native GiST Indexing facility in PostgreSQL☆167Updated 5 years ago
- ☆87Updated 10 months ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆44Updated 6 years ago
- Implementation of perceptual image hash calculation in Python☆130Updated last year
- Github-Repository of the pHash.org library for perceptual hashing.☆222Updated 6 years ago
- Serving content from a WARC☆60Updated 11 years ago
- bktree data structure with a Python interface for a CPP implementation☆13Updated 7 years ago
- A reverse image search algorithm which performs 2D affine transformation-invariant partial image-matching in sublinear time☆290Updated 5 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆39Updated 9 years ago
- isk-daemon is an open source standalone server and library capable of adding content-based (visual) image searching to any image related …☆137Updated 9 years ago
- A multi format lossless image optimizer that uses external tools☆111Updated last week
- A Python binding for libpuzzle.☆45Updated 4 years ago
- Hamming distance between hex strings in SQLite☆24Updated 6 years ago
- Hekate, a highly-concurrent BitTorrent seeder☆74Updated 11 years ago
- C++ implementation of hamming distance algorithm HmSearch using Kyoto Cabinet☆41Updated 8 years ago
- Compare cost, durability, and region support of public cloud object stores, e.g., Amazon S3☆71Updated 6 years ago
- A simple headless browser☆73Updated 10 months ago
- Perceptual Hash project for Videos (MMAI Term Project)☆27Updated 10 years ago
- ☆26Updated 8 years ago
- Saves proxied HTTP traffic to a WARC file.☆26Updated 11 years ago
- Programmable Dropbox for secure IoT☆70Updated 7 years ago
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆153Updated last year
- A tiny HTTP server that can serve files out of any rclone remote.☆39Updated 7 years ago
- Perceptual hashing tools for detecting child sexual abuse material☆178Updated last month
- A Python Perceptual Image Hashing Module☆209Updated 2 years ago
- Insanely fast JPEG/ JPG thumbnail scaling with the minimum fuss and CPU overhead. It makes use of libjpeg features of being able to load …☆265Updated last year
- Fast Python bindings to Sophia Database☆80Updated last month
- 💾 YouTube video metadata archiver written in Golang☆19Updated 4 years ago