mk-fg / image-deduplication-tool
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
☆82Updated 8 years ago
Alternatives and similar repositories for image-deduplication-tool:
Users that are interested in image-deduplication-tool are comparing it to the libraries listed below
- Tool for managing data-deduplication within extant compressed archive files, along with a relatively performant BK tree implementation fo…☆101Updated last year
- CLI utility to find near duplicate images and remove all but the best copy.☆161Updated last week
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 7 years ago
- Implementation of perceptual image hash calculation in Python☆131Updated last year
- Identifies similar pictures on your local computer☆77Updated 5 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆58Updated 6 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- unified cli for various saas image classification apis.☆40Updated 7 years ago
- Deep visual mining for your photos and videos using YOLOv2 deep convolutional neural network based object detector and traditional face …☆22Updated 6 years ago
- Fast, (mostly) lossless JPEG transformations with Python☆148Updated last year
- A Lightroom plugin suggests keywords of photo for you☆43Updated 5 years ago
- A reverse image search algorithm which performs 2D affine transformation-invariant partial image-matching in sublinear time☆290Updated 5 years ago
- Detecting near-duplicate videos by aggregating features from intermediate CNN layers☆97Updated 6 years ago
- Esper instance for TV news analysis☆40Updated 2 years ago
- Tool for downloading sets and photos from Flickr☆237Updated this week
- (MIRROR of https://gitlab.com/vgg/vise) VGG Image Search Engine (VISE) is a standalone software for visual search of large image collecti…☆47Updated 7 months ago
- Grabbing all news.☆62Updated 5 years ago
- Some implementations of algorithms for blur detection in JPEGs☆140Updated 7 years ago
- A Python Perceptual Image Hashing Module☆211Updated 2 years ago
- Perceptual Hash project for Videos (MMAI Term Project)☆27Updated 11 years ago
- Detect source resolution of upscaled images☆242Updated last year
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆202Updated last week
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated last month
- Serving content from a WARC☆61Updated 12 years ago
- Remote client for distributed automated HTTP(s) content fetching.☆77Updated this week
- Software to dewarp book picture images, and for building models of ruled surfaces☆11Updated 9 years ago
- Saves proxied HTTP traffic to a WARC file.☆27Updated 11 years ago
- Programmatically find and read labels using Machine Learning☆46Updated 6 years ago
- Interactive Image similarity and Visual Search and Retrieval application☆96Updated last year