umbrant / QuantileEstimationLinks
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 12 years ago
Alternatives and similar repositories for QuantileEstimation
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
Sorting:
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- A collection of algorithms for mining data streams☆205Updated last year
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- Enabling queries on compressed data.☆281Updated last year
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 2 months ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- Big Data Made Easy☆185Updated 7 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆234Updated 5 years ago
- ☆92Updated 9 years ago
- Java implementation of SAX, HOT-SAX, and EMMA☆83Updated last year
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 7 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆117Updated 3 years ago
- A Java implementation of online kernel density estimation (oKDE)☆32Updated 7 years ago
- Probabilistic data structures for Guava.☆54Updated 4 years ago
- ☆54Updated 6 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆140Updated 8 years ago
- ☆110Updated 8 years ago
- Scalable Machine Learning in Scalding☆361Updated 7 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago
- A CPU and GPU-accelerated matrix library for data mining☆266Updated 4 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Scalable Graph Mining☆63Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆474Updated 8 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- GraphChi's Java version☆238Updated last year
- Secondary sort and streaming reduce for Apache Spark☆78Updated 2 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 9 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 10 years ago