Marcnuth / deduplication

Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
β˜†16Updated last year

Related projects β“˜

Alternatives and complementary repositories for deduplication