dgryski / go-minhash
BottomK minwise hashing for streaming set similarity
☆43Updated 6 years ago
Alternatives and similar repositories for go-minhash:
Users that are interested in go-minhash are comparing it to the libraries listed below
- Minhash LSH in Golang☆25Updated 5 years ago
- sparse levenshtein automaton in go☆24Updated 4 years ago
- Reader and writer of HDF5 files☆18Updated 7 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- bíogo data store repository☆38Updated 4 years ago
- A port of Stream VByte to Go☆35Updated 3 years ago
- S-Bitmap: Distinct Counting with a Self-Learning Bitmap☆37Updated 9 years ago
- libsvm go version☆73Updated 8 years ago
- Regular Expression Research☆11Updated 2 years ago
- A counter data structure that knows when to start estimating to save space☆35Updated 7 years ago
- Trigram search library for Go☆70Updated 10 years ago
- Bloom-filter based search index☆122Updated 3 years ago
- A Go library for space-efficient rank/select operations for both sparse and dense bit arrays.☆37Updated 4 years ago
- Super-efficient, in-memory key/value data store☆21Updated 7 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- Search engine postings list with support for compresison☆11Updated 7 years ago
- LogLog based Cardinality Estimator☆62Updated 7 years ago
- liblinear bindings for Go☆45Updated 6 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 9 years ago
- Go implementation of SIMD-BP128 integer encoding and decoding☆30Updated 3 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- Ngram index for golang☆114Updated 8 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 7 years ago
- Interlink Remote Applications☆21Updated 6 years ago
- A []byte pool for Go.☆41Updated 2 years ago
- Implementation of "An Optimal Suffix Array Construction Algorithm" described in a Technical Report by Ge Nong☆26Updated 12 years ago
- Provide Golang native SIMD intrinsics on x86/amd64 platform☆47Updated 7 years ago
- Bleve Extensions☆48Updated last year
- Multiclass Naive Bayesian Classification☆75Updated 6 years ago