tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,096Updated 7 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,267Updated 5 years ago
- A High Dynamic Range (HDR) Histogram☆2,230Updated last year
- A framework for distributed systems verification, with fault injection☆7,165Updated last month
- In-memory dimensional time series database.☆3,507Updated this week
- Apache Parquet Format☆2,046Updated this week
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,744Updated 3 weeks ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆926Updated this week
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆401Updated 2 years ago
- An extensible distributed system for reliable nearline data streaming at scale☆944Updated 2 months ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,186Updated last year
- MacroBase: A Search Engine for Fast Data☆669Updated 2 years ago
- Distributed storage for sequential data☆1,901Updated 3 years ago
- ☆1,006Updated 3 years ago
- RocksDB Replication☆673Updated last year
- A cluster consistency platform☆658Updated this week
- Beringei is a high performance, in-memory storage engine for time series data.☆3,165Updated 7 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,622Updated 3 months ago
- Secor is a service implementing Kafka log persistence☆1,853Updated 3 weeks ago
- Roshi is a large-scale CRDT set implementation for timestamped events.☆3,173Updated 2 years ago
- The Heroic Time Series Database☆847Updated 4 years ago
- Time-series database☆837Updated 3 years ago
- Improvement of Apache Kafka Mirrormaker☆933Updated last year
- Berkeley Tree Database (BTrDB) server☆909Updated 4 years ago
- An open source clone of Amazon's Dynamo.☆2,673Updated 2 years ago
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆810Updated last week
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,853Updated this week
- Distributed Prometheus time series database☆1,455Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,522Updated last year
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,246Updated this week
- Fast scalable time series database☆1,754Updated 2 weeks ago