pzcddm / FastSketchLSHLinks

This is the repo to implement the fastsketch or even better minhash-based jaccard estimator with Locality sensitive hashing to deduplicate the extreme large text corpus.
41Updated 3 weeks ago

Alternatives and similar repositories for FastSketchLSH

Users that are interested in FastSketchLSH are comparing it to the libraries listed below

Sorting: