dgryski / go-minhashLinks
BottomK minwise hashing for streaming set similarity
☆43Updated 6 years ago
Alternatives and similar repositories for go-minhash
Users that are interested in go-minhash are comparing it to the libraries listed below
Sorting:
- Minhash LSH in Golang☆27Updated 5 years ago
- Bloom-filter based search index☆124Updated 3 years ago
- package stl implements seasonal-trend decomposition by LOESS.☆48Updated 2 years ago
- Ngram index for golang☆114Updated 9 years ago
- npyio provides read/write access to numpy data files.☆63Updated 6 months ago
- Reader and writer of HDF5 files☆18Updated 8 years ago
- High Performance Porter2 Stemmer☆47Updated 4 years ago
- LogLog based Cardinality Estimator☆63Updated 7 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆90Updated 2 years ago
- S-Bitmap: Distinct Counting with a Self-Learning Bitmap☆37Updated 9 years ago
- bíogo data store repository☆38Updated 4 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 9 years ago
- A Go library for space-efficient rank/select operations for both sparse and dense bit arrays.☆37Updated 5 years ago
- Integer Compression Libraries for Go☆132Updated 7 years ago
- Go implementation of Count-Min-Log☆67Updated 6 months ago
- libsvm go version☆73Updated 9 years ago
- Go implementation of SIMD-BP128 integer encoding and decoding☆31Updated 3 years ago
- P-Square Algorithm in Go☆36Updated 3 years ago
- Dremel DB Column Striping and Record Assembly Algorithms in Golang☆21Updated 12 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 7 years ago
- A counter data structure that knows when to start estimating to save space☆34Updated 7 years ago
- Package mafsa implements Minimal Acyclic Finite State Automata in Go, essentially a high-speed, memory-efficient, Unicode-friendly set of…☆295Updated 6 years ago
- simhash storage and searching☆138Updated 8 years ago
- Interpolation search☆12Updated 3 years ago
- SSE-optimized group varint integer encoding☆38Updated 2 years ago
- Fast Longest Common Substring in Go☆47Updated 8 months ago
- gk: streaming quantiles☆45Updated 3 years ago
- Careful implementation of Jaro and Jaro-Winkler text difference algorithms☆17Updated 8 years ago
- Fast integer map for uint32-to-uint32☆31Updated 2 months ago
- Locality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)☆107Updated 7 years ago