tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,118Updated 10 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,263Updated 6 years ago
- In-memory dimensional time series database.☆3,525Updated this week
- A High Dynamic Range (HDR) Histogram☆2,331Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆940Updated this week
- Berkeley Tree Database (BTrDB) server☆910Updated 4 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,798Updated 3 weeks ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,189Updated 2 years ago
- ☆1,008Updated 4 years ago
- Beringei is a high performance, in-memory storage engine for time series data.☆3,167Updated 7 years ago
- Apache Parquet Format☆2,166Updated last week
- RocksDB Replication☆679Updated last year
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆827Updated 3 months ago
- Distributed storage for sequential data☆1,905Updated 4 years ago
- An open source clone of Amazon's Dynamo.☆2,682Updated 2 years ago
- A cluster consistency platform☆660Updated last week
- What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolati…☆2,651Updated last year
- Mirror of Apache Samza☆834Updated 8 months ago
- WiredTiger's source tree☆2,362Updated this week
- Mirror of Apache Cassandra (incubating)☆438Updated 2 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 3 years ago
- Secor is a service implementing Kafka log persistence☆1,853Updated this week
- Distributed Prometheus time series database☆1,461Updated 2 weeks ago
- An extensible distributed system for reliable nearline data streaming at scale☆950Updated last month
- Simple constant key/value storage library, for read-heavy systems with infrequent large bulk inserts.☆1,203Updated 2 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,629Updated last month
- Distributed object store☆1,781Updated 2 weeks ago
- Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance r…☆1,327Updated last week
- A Kubernetes toolkit for building distributed applications using cloud native principles☆2,363Updated last year
- A fast linearizability checker written in Go 🔎☆1,126Updated 2 weeks ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆756Updated this week