dgryski / go-fastquantilesLinks
approximate streaming quantiles
☆31Updated 11 years ago
Alternatives and similar repositories for go-fastquantiles
Users that are interested in go-fastquantiles are comparing it to the libraries listed below
Sorting:
- Reduce your data. A unix filter for algebird-powered aggregation.☆140Updated 8 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Probabilistic Multiplicity Counting☆49Updated 10 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- Streaming Parallel Decision Tree☆54Updated last year
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆103Updated 4 years ago
- A mulitarmed bandit to A/B test go projects, or other languages via an API.☆71Updated 11 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 9 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- An implementation of HLL++ in go☆70Updated 6 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆117Updated 3 years ago
- A scala-based feature generation and modeling framework☆61Updated 7 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 10 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Updated 8 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Updated 9 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Updated 6 years ago
- ☆92Updated 9 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- gk: streaming quantiles☆45Updated 3 years ago
- Compressing and Decoding Term Statistics Time Series -- ECIR 2016☆10Updated 9 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- An Apache Spark-shell backend for IPython☆105Updated 4 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆44Updated 9 years ago
- Splash Project for parallel stochastic learning☆94Updated 8 years ago
- Pig on Apache Spark☆82Updated 10 years ago