umbrant / QuantileEstimationLinks
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 13 years ago
Alternatives and similar repositories for QuantileEstimation
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
Sorting:
- A collection of algorithms for mining data streams☆205Updated last year
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆661Updated 11 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Enabling queries on compressed data.☆281Updated last year
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- Big Data Made Easy☆185Updated 7 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 8 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- ☆110Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Updated 8 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆237Updated 5 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- ☆92Updated 10 years ago
- An efficient updatable key-value store for Apache Spark☆254Updated 8 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Updated 8 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 2 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆83Updated 4 months ago
- Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark☆337Updated 2 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆115Updated 10 years ago
- ☆54Updated 6 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- Probabilistic data structures for Guava.☆54Updated 5 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆434Updated 7 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated 2 years ago
- Mirror of Apache Samoa (Incubating)☆250Updated 2 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 10 years ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆507Updated 7 years ago