umbrant / QuantileEstimationLinks
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 12 years ago
Alternatives and similar repositories for QuantileEstimation
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
Sorting:
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago
- Probabilistic data structures for Guava.☆54Updated 4 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 2 weeks ago
- A collection of algorithms for mining data streams☆204Updated last year
- ☆92Updated 9 years ago
- Big Data Made Easy☆185Updated 7 years ago
- Enabling queries on compressed data.☆280Updated last year
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 10 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 7 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated this week
- ☆110Updated 8 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆140Updated 8 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Persistent Adaptive Radix Trees in Java☆82Updated 4 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 5 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- ☆52Updated 6 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 9 years ago
- Streaming Parallel Decision Tree☆54Updated last year
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 7 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated last year