dgryski / go-minhashLinks
BottomK minwise hashing for streaming set similarity
☆43Updated 6 years ago
Alternatives and similar repositories for go-minhash
Users that are interested in go-minhash are comparing it to the libraries listed below
Sorting:
- Minhash LSH in Golang☆25Updated 5 years ago
- Reader and writer of HDF5 files☆18Updated 7 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- Bloom-filter based search index☆123Updated 3 years ago
- S-Bitmap: Distinct Counting with a Self-Learning Bitmap☆37Updated 9 years ago
- A port of Stream VByte to Go☆35Updated 3 years ago
- A counter data structure that knows when to start estimating to save space☆35Updated 7 years ago
- bíogo data store repository☆38Updated 4 years ago
- Ngram index for golang☆114Updated 9 years ago
- LogLog based Cardinality Estimator☆62Updated 7 years ago
- go-judy is a Go language wrapper of the Judy array implementation at http://judy.sourceforge.net.☆29Updated 11 years ago
- Trigram search library for Go☆69Updated 10 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- Go implementation of SIMD-BP128 integer encoding and decoding☆30Updated 3 years ago
- liblinear bindings for Go☆45Updated 6 years ago
- Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory☆45Updated 7 years ago
- gk: streaming quantiles☆45Updated 3 years ago
- Go implementation of Count-Min-Log☆67Updated 3 months ago
- A Go library for space-efficient rank/select operations for both sparse and dense bit arrays.☆37Updated 4 years ago
- Streaming TopK estimates☆87Updated 4 years ago
- Search engine postings list with support for compresison☆11Updated 8 years ago
- P-Square Algorithm in Go☆36Updated 3 years ago
- My sandbox for go experiments☆27Updated 5 years ago
- SSE-optimized group varint integer encoding☆37Updated last year
- Efficient thread-safe circular byte buffer to keep in-memory logs☆21Updated 4 years ago
- Probabilistic Multiplicity Counting☆49Updated 9 years ago
- PopCount implementation for Go. Using hardware POPCNT instruction if available it.☆23Updated 8 years ago
- Provide Golang native SIMD intrinsics on x86/amd64 platform☆47Updated 8 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 7 years ago
- Collection of statistical routines in golang☆31Updated 7 years ago