seiflotfy / superminhashLinks
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation
☆25Updated 8 years ago
Alternatives and similar repositories for superminhash
Users that are interested in superminhash are comparing it to the libraries listed below
Sorting:
- Bleve Extensions☆46Updated last year
- A golang streaming histogram sketch. Fast quantiles and counts below a threshold.☆45Updated 2 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆88Updated 3 years ago
- Fast integer map for uint32-to-uint32☆32Updated 6 months ago
- A fast collection type that uses uint64 for keys.☆44Updated 5 years ago
- shoco is a compressor for small text strings. [Not maintained].☆10Updated 6 years ago
- Minimal Perfect Hashing for Go☆191Updated last year
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 8 years ago
- LogLog based Cardinality Estimator☆63Updated 8 years ago
- A Go implementation of the Elias-Fano encoding☆40Updated last year
- GO binary I/O☆18Updated 6 years ago
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆65Updated 12 years ago
- A simple tool to collect and process quite a few web news from multiple sources☆36Updated 3 years ago
- go-judy is a Go language wrapper of the Judy array implementation at http://judy.sourceforge.net.☆29Updated 12 years ago
- P-Square Algorithm in Go☆36Updated 3 years ago
- Succinct Data Structure of Trie, written in Go☆42Updated 4 years ago
- Interpolation search☆12Updated 4 years ago
- fast int64-int64 map for go☆99Updated 2 months ago
- Pure-Go full text indexer and search library☆94Updated 10 years ago
- An experimental KV store, which implements an LSM on top of Bolt segments.☆32Updated 9 years ago
- A port of Stream VByte to Go☆35Updated 3 years ago
- Go implementation of Count-Min-Log☆69Updated 10 months ago
- Go difference algorithm☆60Updated 12 years ago
- Various parsing utilities, such as IP, time, and top-level-domain, in Go☆25Updated 9 years ago
- High Performance Porter2 Stemmer☆47Updated 5 years ago
- sharded key-value store compatible with p5-ShardedKV☆38Updated 5 years ago
- Binary heap priority queues in Go☆32Updated 4 years ago
- Bloom-filter based search index☆126Updated 4 years ago
- Various packages for working with Apache Arrow in Go, including a DataFrame implementation.☆29Updated 11 months ago
- Directed Acyclic Word Graph implementation in Go, with fuzzy search of words in the graph.☆31Updated 12 years ago