tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,078Updated 5 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,260Updated 5 years ago
- A High Dynamic Range (HDR) Histogram☆2,218Updated last year
- What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolati…☆2,604Updated 9 months ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆915Updated this week
- A framework for distributed systems verification, with fault injection☆7,107Updated 2 weeks ago
- Berkeley Tree Database (BTrDB) server☆910Updated 3 years ago
- RocksDB Replication☆668Updated last year
- Time-series database☆837Updated 2 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,696Updated last month
- Apache Parquet Format☆1,982Updated 2 weeks ago
- Distributed Prometheus time series database☆1,446Updated last week
- In-memory dimensional time series database.☆3,497Updated 2 weeks ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,616Updated 2 weeks ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,184Updated last year
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,522Updated last year
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆733Updated this week
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆801Updated this week
- A cluster consistency platform☆652Updated last week
- Beringei is a high performance, in-memory storage engine for time series data.☆3,167Updated 7 years ago
- MacroBase: A Search Engine for Fast Data☆668Updated 2 years ago
- Roshi is a large-scale CRDT set implementation for timestamped events.☆3,170Updated 2 years ago
- Distributed storage for sequential data☆1,904Updated 3 years ago
- Fast scalable time series database☆1,747Updated 2 months ago
- A low-latency, cloud-native KVS☆705Updated 4 years ago
- The Self-Driving Database Management System☆2,046Updated 6 years ago
- Curated list of resources on testing distributed systems☆2,561Updated 2 months ago
- Mirror of Apache Samza☆829Updated 2 months ago
- An extensible distributed system for reliable nearline data streaming at scale☆941Updated last year
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆985Updated last month
- A D3.js plugin that produces flame graphs from hierarchical data.☆925Updated last year