tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,131Updated 11 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- Stream summarizer and cardinality estimator.☆2,266Updated 6 years ago
- A framework for distributed systems verification, with fault injection☆7,308Updated 3 weeks ago
- A High Dynamic Range (HDR) Histogram☆2,351Updated last year
- Beringei is a high performance, in-memory storage engine for time series data.☆3,170Updated 7 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,814Updated 3 weeks ago
- Distributed storage for sequential data☆1,906Updated 4 years ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆945Updated this week
- In-memory dimensional time series database.☆3,535Updated last week
- Berkeley Tree Database (BTrDB) server☆911Updated 4 years ago
- RocksDB Replication☆681Updated last year
- WiredTiger's source tree☆2,373Updated this week
- A cluster consistency platform☆659Updated last week
- Sources for my PhD dissertation on the Raft consensus algorithm☆1,062Updated 9 years ago
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,532Updated last year
- An open source clone of Amazon's Dynamo.☆2,682Updated 2 years ago
- Distributed object store☆1,780Updated last week
- ☆1,008Updated 4 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,641Updated 2 months ago
- An extensible distributed system for reliable nearline data streaming at scale☆952Updated last week
- Roshi is a large-scale CRDT set implementation for timestamped events.☆3,176Updated 3 months ago
- Apache Parquet Format☆2,224Updated last week
- Fast scalable time series database☆1,751Updated 2 weeks ago
- A scalable, distributed Time Series Database.☆5,064Updated last year
- A generic dynamo implementation for different k-v storage engines☆4,225Updated last year
- Apache Avro is a data serialization system.☆3,216Updated last week
- A Java package to automatically detect anomalies in large scale time-series data☆1,189Updated 2 years ago
- Mirror of Apache Samza☆837Updated 9 months ago
- A library that provides an embeddable, persistent key-value store for fast storage optimized for AWS☆829Updated 4 months ago
- Distributed Prometheus time series database☆1,462Updated last week
- A low-latency, cloud-native KVS☆707Updated 4 years ago