tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,108Updated 9 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,264Updated 5 years ago
- A High Dynamic Range (HDR) Histogram☆2,279Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆932Updated this week
- In-memory dimensional time series database.☆3,517Updated this week
- Beringei is a high performance, in-memory storage engine for time series data.☆3,166Updated 7 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,767Updated 3 weeks ago
- ☆1,007Updated 4 years ago
- Apache Parquet Format☆2,100Updated last month
- Distributed Prometheus time series database☆1,457Updated last week
- Berkeley Tree Database (BTrDB) server☆910Updated 4 years ago
- Fast scalable time series database☆1,753Updated this week
- A framework for distributed systems verification, with fault injection☆7,213Updated this week
- Distributed storage for sequential data☆1,905Updated 4 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,627Updated this week
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆641Updated last year
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆818Updated 2 months ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,187Updated 2 years ago
- RocksDB Replication☆675Updated last year
- MacroBase: A Search Engine for Fast Data☆671Updated 2 years ago
- Time-series database☆843Updated 3 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,997Updated 2 weeks ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆749Updated this week
- Secor is a service implementing Kafka log persistence☆1,853Updated last month
- A cluster consistency platform☆661Updated 2 weeks ago
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆1,015Updated this week
- An extensible distributed system for reliable nearline data streaming at scale☆950Updated last week
- Mirror of Apache Samza☆833Updated 6 months ago
- Jeff Dean's latency numbers plotted over time☆2,138Updated last year
- A fast key/value store that is efficient for high-volume random access reads and writes.☆356Updated 8 years ago
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,525Updated last year