tdunning / t-digestLinks
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
☆2,072Updated 4 months ago
Alternatives and similar repositories for t-digest
Users that are interested in t-digest are comparing it to the libraries listed below
Sorting:
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆396Updated 2 years ago
- Distributed storage for sequential data☆1,903Updated 3 years ago
- Beringei is a high performance, in-memory storage engine for time series data.☆3,167Updated 6 years ago
- In-memory dimensional time series database.☆3,494Updated this week
- A High Dynamic Range (HDR) Histogram☆2,215Updated 11 months ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,080Updated 3 years ago
- Stream summarizer and cardinality estimator.☆2,259Updated 5 years ago
- Berkeley Tree Database (BTrDB) server☆910Updated 3 years ago
- A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others☆3,682Updated last week
- MacroBase: A Search Engine for Fast Data☆668Updated 2 years ago
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,523Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆914Updated last week
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Updated 6 years ago
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆424Updated 2 years ago
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆982Updated last month
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,624Updated 2 years ago
- A scalable, distributed Time Series Database.☆5,045Updated 6 months ago
- Distributed Stream and Batch Processing☆1,103Updated 6 months ago
- Mirror of Apache Samza☆826Updated last month
- A cluster consistency platform☆651Updated last week
- Secor is a service implementing Kafka log persistence☆1,850Updated last week
- RocksDB Replication☆668Updated last year
- Apache Parquet Format☆1,967Updated last week
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,217Updated 5 years ago
- Roshi is a large-scale CRDT set implementation for timestamped events.☆3,170Updated 2 years ago
- A framework for distributed systems verification, with fault injection☆7,085Updated last month
- Time Series and FoundationDB. Millions of writes/s and 10x compression in under 2,000 lines of Go.☆516Updated 5 years ago
- A novel implementation of the Raft consensus algorithm☆580Updated 7 years ago
- Mirror of Apache Helix☆481Updated this week
- Distributed Big Data Orchestration Service☆1,739Updated this week