umbrant / QuantileEstimationLinks
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 13 years ago
Alternatives and similar repositories for QuantileEstimation
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
Sorting:
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- A collection of algorithms for mining data streams☆205Updated 2 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Enabling queries on compressed data.☆282Updated 2 years ago
- ☆53Updated 6 years ago
- ☆92Updated 10 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆428Updated 9 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆237Updated 5 years ago
- Distributed Matrix Library☆72Updated 9 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- ☆110Updated 8 years ago
- Big Data Made Easy☆185Updated 8 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 3 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆58Updated 8 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆474Updated 8 years ago
- Splash Project for parallel stochastic learning☆93Updated 8 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- Real²time Exploratory Analytics on Large Datasets☆121Updated 6 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆16Updated 5 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆508Updated 7 years ago
- ☆460Updated 2 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 8 years ago
- An efficient updatable key-value store for Apache Spark☆254Updated 8 years ago
- A CPU and GPU-accelerated matrix library for data mining☆267Updated 4 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Updated 8 years ago