seiflotfy / cmts
Count-Min Tree Sketch: Approximate counting for NLP
☆10Updated 7 years ago
Alternatives and similar repositories for cmts:
Users that are interested in cmts are comparing it to the libraries listed below
- A counter data structure that knows when to start estimating to save space☆35Updated 7 years ago
- LogLog based Cardinality Estimator☆61Updated 7 years ago
- A simple database optimized for returning results by custom scoring functions.☆20Updated 8 years ago
- Probabilistic Multiplicity Counting☆49Updated 9 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆10Updated 9 years ago
- ☆37Updated 6 years ago
- Hyper-Compact Virtual Estimators for Big Network Data Based on Register Sharing☆33Updated 7 years ago
- Golang Hash Array Map Trie☆11Updated 2 years ago
- Go package that implements a bit array and some utility functions☆10Updated 9 years ago
- Concurrent inverse Bloom filter.☆13Updated 9 years ago
- gk: streaming quantiles☆45Updated 3 years ago
- Go interface to Constant Databases (CDB)☆10Updated 8 years ago
- Source of paper “A critique of the CAP theorem”☆16Updated 9 years ago
- hokusai -- sketching streams in real-time☆78Updated 7 years ago
- S-Bitmap: Distinct Counting with a Self-Learning Bitmap☆37Updated 9 years ago
- Package trace extends the features of the Go execution tracer.☆8Updated 6 years ago
- ☆23Updated 8 years ago
- Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams☆37Updated 7 years ago
- Golang client for HyperLogLog daemon (hlld)☆21Updated 9 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 9 years ago
- Morally-correct string and stream interpolation for Go.☆24Updated 8 years ago
- Super-efficient, in-memory key/value data store☆21Updated 7 years ago
- A Trie data structure that allows for fuzzy string matching☆11Updated 9 years ago
- Careful implementation of Jaro and Jaro-Winkler text difference algorithms☆17Updated 8 years ago
- TMFRAME, pronounced "time frame", is a binary standard for compactly encoding time series data☆27Updated 6 years ago
- Search engine postings list with support for compresison☆11Updated 7 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 8 years ago
- Go implemetation of cuckoo filters☆28Updated 3 years ago
- ☆11Updated 2 years ago
- github.com/cznic/interval has moved to modernc.org/interval☆11Updated 6 years ago