umbrant / QuantileEstimationLinks
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 13 years ago
Alternatives and similar repositories for QuantileEstimation
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
Sorting:
- A collection of algorithms for mining data streams☆205Updated 2 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆166Updated 4 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Enabling queries on compressed data.☆281Updated 2 years ago
- ☆92Updated 10 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆428Updated 9 years ago
- ☆53Updated 6 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆237Updated 5 years ago
- Big Data Made Easy☆185Updated 8 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 3 years ago
- ☆110Updated 8 years ago
- Real²time Exploratory Analytics on Large Datasets☆121Updated 5 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Updated 8 years ago
- Mirror of Apache Samoa (Incubating)☆250Updated 2 years ago
- Probabilistic data structures for Guava.☆54Updated 5 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 5 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆508Updated 7 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Updated 8 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆58Updated 8 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 10 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆16Updated 5 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Java implementation of SAX, HOT-SAX, and EMMA☆83Updated 2 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 8 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆434Updated 7 years ago