seiflotfy / superminhashLinks
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation
☆23Updated 7 years ago
Alternatives and similar repositories for superminhash
Users that are interested in superminhash are comparing it to the libraries listed below
Sorting:
- Bleve Extensions☆48Updated last year
- LogLog based Cardinality Estimator☆62Updated 7 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- Fast integer map for uint32-to-uint32☆28Updated 2 weeks ago
- ☆23Updated 8 years ago
- shoco is a compressor for small text strings.☆10Updated 5 years ago
- A service to manage your Cuckoo filters☆18Updated 7 years ago
- Various parsing utilities, such as IP, time, and top-level-domain, in Go☆25Updated 8 years ago
- A counter data structure that knows when to start estimating to save space☆35Updated 7 years ago
- A Go implementation of the Elias-Fano encoding☆39Updated 11 months ago
- Difference hash of images☆19Updated last year
- A fast collection type that uses uint64 for keys.☆44Updated 4 years ago
- Raft backend implementation using BuntDB☆17Updated 5 years ago
- A Go library for space-efficient rank/select operations for both sparse and dense bit arrays.☆37Updated 4 years ago
- Go driver for ragel scanners☆36Updated 5 years ago
- A golang streaming histogram sketch. Fast quantiles and counts below a threshold.☆44Updated last year
- blance - functional algorithm to assign partitions and replicas across distributed nodes☆14Updated last year
- Replication for Boltdb databases.☆29Updated 8 years ago
- Golang: contex-aware synchronization primitives (mutex).☆32Updated last year
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- sparse levenshtein automaton in go☆24Updated 4 years ago
- Minhash LSH in Golang☆25Updated 5 years ago
- EWAH bitmap compression library for Go.☆16Updated 3 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 7 years ago
- A Go DataFrame built on Apache Arrow☆12Updated 4 years ago
- Faster than builtin sorting using primitives.☆17Updated 9 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 9 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- npyio provides read/write access to numpy data files.☆62Updated 3 months ago
- P-Square Algorithm in Go☆36Updated 3 years ago