umbrant / QuantileEstimation
Streaming estimation of percentiles, especially high percentiles.
☆63Updated 12 years ago
Alternatives and similar repositories for QuantileEstimation:
Users that are interested in QuantileEstimation are comparing it to the libraries listed below
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- A collection of algorithms for mining data streams☆203Updated last year
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Probabilistic data structures for Guava.☆54Updated 4 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Persistent Adaptive Radix Trees in Java☆81Updated 4 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Streaming Parallel Decision Tree☆54Updated last year
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Enabling queries on compressed data.☆278Updated last year
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Distributed Matrix Library☆71Updated 8 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 9 years ago
- ☆52Updated 6 years ago
- ☆111Updated 8 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- Real²time Exploratory Analytics on Large Datasets☆122Updated 5 years ago
- Schema and type system for creating sortable byte[]☆46Updated 12 years ago
- ☆92Updated 9 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 8 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- Behrooz File System (BFS)☆54Updated 9 years ago
- Scala stuff☆18Updated 5 years ago