ALShum / MinHashLSH

Java implementation for MinHash and LSH for finding near duplicate documents as measured by Jaccard similarity.
31Updated 9 years ago

Related projects

Alternatives and complementary repositories for MinHashLSH