mk-fg / image-deduplication-toolLinks
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
☆82Updated 8 years ago
Alternatives and similar repositories for image-deduplication-tool
Users that are interested in image-deduplication-tool are comparing it to the libraries listed below
Sorting:
- Tool for managing data-deduplication within extant compressed archive files, along with a relatively performant BK tree implementation fo…☆102Updated last year
- A Python Perceptual Image Hashing Module☆212Updated 2 years ago
- Grabbing all news.☆62Updated 5 years ago
- Implementation of perceptual image hash calculation in Python☆131Updated last year
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Detecting near-duplicate videos by aggregating features from intermediate CNN layers☆98Updated 7 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Serving content from a WARC☆61Updated 12 years ago
- Video fingerprinting tool. Finding duplicate movies in a large dataset.☆43Updated 12 years ago
- (MIRROR of https://gitlab.com/vgg/vise) VGG Image Search Engine (VISE) is a standalone software for visual search of large image collecti…☆47Updated 8 months ago
- Tutorial on detecting video shot changes using Python and OpenCV. Part 1 covers basic threshold detection, Part 2 covers optimized thres…☆50Updated 4 years ago
- A reverse image search algorithm which performs 2D affine transformation-invariant partial image-matching in sublinear time☆290Updated 5 years ago
- Analyzes a video stream and returns an image the represents the movie's 'fingerprint'.☆17Updated 7 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 6 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 7 years ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆201Updated 3 weeks ago
- Esper instance for TV news analysis☆40Updated 2 years ago
- recursively deduplicate a directory and write its contents to a new directory while remembering the old paths☆49Updated 4 years ago
- Tool for downloading sets and photos from Flickr☆239Updated this week
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 2 months ago
- A multi format lossless image optimizer that uses external tools☆112Updated 3 weeks ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆158Updated 2 years ago
- A Python 3 wrapper around the ffprobe command for extracting meta data from media files.☆22Updated 4 years ago
- Perceptual Hash project for Videos (MMAI Term Project)☆27Updated 11 years ago
- unified cli for various saas image classification apis.☆40Updated 7 years ago
- 🧠 AI powered image tagger backed by DeepDetect☆246Updated 6 years ago
- Automated shot detection software☆200Updated last year
- Turn video files into 'barcodes' where vertical lines represent the average colour of individual frames.☆84Updated 9 years ago
- A comparison of ffmpeg, Shotdetect and PySceneDetect for shot transition detection☆121Updated 7 years ago