seiflotfy / superminhash
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation
☆24Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for superminhash
- Bleve Extensions☆47Updated 8 months ago
- LogLog based Cardinality Estimator☆61Updated 7 years ago
- A fast collection type that uses uint64 for keys.☆44Updated 4 years ago
- Succinct Data Structure of Trie, written in Go☆42Updated 3 years ago
- A counter data structure that knows when to start estimating to save space☆36Updated 7 years ago
- BottomK minwise hashing for streaming set similarity☆42Updated 5 years ago
- Fast integer map for uint32-to-uint32☆28Updated last week
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams☆37Updated 7 years ago
- shoco is a compressor for small text strings.☆10Updated 5 years ago
- HyperLogLog++ for Go☆43Updated 6 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- Hyper-Compact Virtual Estimators for Big Network Data Based on Register Sharing☆33Updated 7 years ago
- Raft backend implementation using BuntDB☆16Updated 4 years ago
- Go implementation of Count-Min-Log☆66Updated 7 years ago
- ☆23Updated 8 years ago
- blance - functional algorithm to assign partitions and replicas across distributed nodes☆14Updated 7 months ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 6 years ago
- liblinear bindings for Go☆45Updated 6 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 8 years ago
- Interpolation search☆12Updated 3 years ago
- Integer Compression Libraries for Go☆129Updated 6 years ago
- [deprecated] use logical decoding in postgres 9.4 or later for similar functionality☆52Updated 6 years ago
- Sliding-LogLog-Beta☆36Updated 7 years ago
- My sandbox for go experiments☆27Updated 4 years ago
- sharded key-value store compatible with p5-ShardedKV☆36Updated 4 years ago
- Algo exposes the same hashing algorithms used by the Go runtime.☆14Updated 6 years ago
- Go interface to Constant Databases (CDB)☆10Updated 8 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 8 years ago
- Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory☆44Updated 6 years ago