axiomhq / hyperminhash
HyperMinHash: Bringing intersections to HyperLogLog
☆302Updated 6 years ago
Alternatives and similar repositories for hyperminhash:
Users that are interested in hyperminhash are comparing it to the libraries listed below
- Quickly detect already witnessed data.☆157Updated 6 months ago
- Optimal Quantile Approximation in Streams☆163Updated last year
- HyperBitBit☆133Updated 7 years ago
- Go implementation of MIDAS: Microcluster-Based Detector of Anomalies in Edge Streams☆187Updated 4 years ago
- Implementations of a data structure with false negatives but no false positives.☆355Updated last year
- ☆37Updated 6 years ago
- LogLog based Cardinality Estimator☆61Updated 7 years ago
- Accelerated Sparse Linear Algebra with Postgres and SuiteSparse☆369Updated last month
- A learned index structure☆52Updated 4 years ago
- Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams☆37Updated 7 years ago
- A General-Purpose Counting Filter: Counting Quotient Filter☆127Updated last year
- UI for interactive data analysis | https://snorkel.logv.org☆161Updated 10 months ago
- columnar storage + NoSQL OLAP engine | https://logv.org☆306Updated 5 months ago
- Fabric is a simple triplestore written in Golang☆198Updated 2 years ago
- hokusai -- sketching streams in real-time☆78Updated 7 years ago
- Go implementations of the distributed quantile sketch algorithm DDSketch☆172Updated 2 weeks ago
- BloomFilter in python☆102Updated 7 years ago
- Pilosa Dev Kit - implementation tooling and use case examples are here!☆31Updated 2 years ago
- Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams☆241Updated 7 years ago
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago
- Mmap radix sort file by a fixed length prefix of each line☆52Updated 4 years ago
- Hyper-Compact Virtual Estimators for Big Network Data Based on Register Sharing☆33Updated 7 years ago
- Probabilistic Multiplicity Counting☆49Updated 9 years ago
- poor man's kafka (plus in-place mutations and search)☆109Updated 2 years ago
- Distributed Named Pipes☆454Updated 7 years ago
- Golomb Coded Sets☆91Updated 7 years ago
- A key/value store for serving static batch data☆175Updated last year
- Interactive visualization framework for Runway models of distributed systems☆189Updated 2 years ago
- d-left Counting Bloom Filter☆56Updated 9 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 2 years ago