Marcnuth / deduplication

Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
18Updated last year

Alternatives and similar repositories for deduplication:

Users that are interested in deduplication are comparing it to the libraries listed below