tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,065Updated 3 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,257Updated 5 years ago
- A framework for distributed systems verification, with fault injection☆7,051Updated 3 weeks ago
- A High Dynamic Range (HDR) Histogram☆2,213Updated 11 months ago
- In-memory dimensional time series database.☆3,489Updated last week
- An extensible distributed system for reliable nearline data streaming at scale☆940Updated last year
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,669Updated last month
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆910Updated last week
- Distributed Prometheus time series database☆1,441Updated last week
- A GPU-powered real-time analytics storage and query engine.☆3,052Updated 10 months ago
- A cluster consistency platform☆650Updated this week
- Distributed storage for sequential data☆1,904Updated 3 years ago
- Improvement of Apache Kafka Mirrormaker☆929Updated last year
- MacroBase: A Search Engine for Fast Data☆667Updated 2 years ago
- RocksDB Replication☆669Updated 11 months ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,237Updated last week
- Distributed object store☆1,754Updated this week
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆424Updated 2 years ago
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆979Updated 2 weeks ago
- A modular implementation of timely dataflow in Rust☆3,444Updated 3 weeks ago
- Apache Parquet Format☆1,953Updated 2 weeks ago
- Some notes on things I find interesting and important.☆2,000Updated this week
- Beringei is a high performance, in-memory storage engine for time series data.☆3,170Updated 6 years ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,080Updated 3 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,614Updated last week
- What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolati…☆2,586Updated 7 months ago
- High-performance time-series aggregation for PostgreSQL☆2,647Updated 3 years ago
- An implementation of differential dataflow using timely dataflow on Rust.☆2,703Updated 2 weeks ago
- Time-series database☆836Updated 2 years ago
- A generic dynamo implementation for different k-v storage engines☆4,203Updated last year
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Updated 6 years ago