ekzhu / go-datasketch
Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)
☆11Updated 9 years ago
Alternatives and similar repositories for go-datasketch:
Users that are interested in go-datasketch are comparing it to the libraries listed below
- Go implemetation of cuckoo filters☆28Updated 3 years ago
- Automatic codegen for encoding/binary marshaling☆17Updated 10 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- Go interface to Constant Databases (CDB)☆10Updated 9 years ago
- S-Bitmap: Distinct Counting with a Self-Learning Bitmap☆37Updated 9 years ago
- LogLog based Cardinality Estimator☆62Updated 7 years ago
- github.com/cznic/interval has moved to modernc.org/interval☆11Updated 6 years ago
- Efficient thread-safe circular byte buffer to keep in-memory logs☆21Updated 4 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 9 years ago
- Concurrent inverse Bloom filter.☆13Updated 10 years ago
- A file backed circular buffer for Go. Mmap file into memory. Madvise to lower memory footprint.☆81Updated 6 months ago
- Golang client for HyperLogLog daemon (hlld)☆21Updated 9 years ago
- HyperLogLog++ for Go☆43Updated 7 years ago
- bufferManager demonstrates a technique to decrease pressure on GC in Go.☆38Updated 10 years ago
- A Trie data structure that allows for fuzzy string matching☆11Updated 9 years ago
- ☆15Updated 7 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- A counter data structure that knows when to start estimating to save space☆35Updated 7 years ago
- Dremel DB Column Striping and Record Assembly Algorithms in Golang☆21Updated 11 years ago
- hokusai -- sketching streams in real-time☆78Updated 7 years ago
- pcg random number generator☆41Updated 3 years ago
- blance - functional algorithm to assign partitions and replicas across distributed nodes☆14Updated last year
- adaptive replacement cache☆34Updated 10 years ago
- Super-efficient, in-memory key/value data store☆21Updated 7 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 9 years ago
- Various parsing utilities, such as IP, time, and top-level-domain, in Go☆25Updated 8 years ago
- Careful implementation of Jaro and Jaro-Winkler text difference algorithms☆17Updated 8 years ago
- gk: streaming quantiles☆45Updated 3 years ago
- TMFRAME, pronounced "time frame", is a binary standard for compactly encoding time series data☆27Updated 6 years ago
- Morally-correct string and stream interpolation for Go.☆24Updated 9 years ago