tdunning / t-digest
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,036Updated last month
Alternatives and similar repositories for t-digest:
Users that are interested in t-digest are comparing it to the libraries listed below
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆392Updated last year
- Stream summarizer and cardinality estimator.☆2,257Updated 5 years ago
- A High Dynamic Range (HDR) Histogram☆2,194Updated 9 months ago
- MacroBase: A Search Engine for Fast Data☆665Updated 2 years ago
- An extensible distributed system for reliable nearline data streaming at scale☆934Updated 10 months ago
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,524Updated last year
- Berkeley Tree Database (BTrDB) server☆911Updated 3 years ago
- Apache Parquet Format☆1,914Updated last week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,238Updated last week
- Beringei is a high performance, in-memory storage engine for time series data.☆3,171Updated 6 years ago
- In-memory dimensional time series database.☆3,477Updated last week
- Vectorized processing for Apache Arrow☆484Updated 3 years ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,608Updated 4 years ago
- RocksDB Replication☆669Updated 9 months ago
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,812Updated this week
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆906Updated this week
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆969Updated 2 weeks ago
- HeavyDB (formerly OmniSciDB)☆2,976Updated 6 months ago
- A fast linearizability checker written in Go 🔎☆1,003Updated last month
- What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolati…☆2,549Updated 5 months ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,963Updated 2 weeks ago
- ☆980Updated 3 years ago
- The Self-Driving Database Management System☆2,038Updated 5 years ago
- Time-series database☆835Updated 2 years ago
- The FastPFOR C++ library: Fast integer compression☆910Updated 2 weeks ago
- Distributed Prometheus time series database☆1,434Updated this week
- Some notes on things I find interesting and important.☆1,988Updated 2 weeks ago
- A simple integer compression library in Java☆546Updated 9 months ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,642Updated last week
- Fast scalable time series database☆1,741Updated 3 months ago