VDalibard / BOATLinks
☆54Updated 6 years ago
Alternatives and similar repositories for BOAT
Users that are interested in BOAT are comparing it to the libraries listed below
Sorting:
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- ☆46Updated 8 years ago
- communication-efficient distributed coordinate ascent☆91Updated 6 years ago
- A scala-based feature generation and modeling framework☆61Updated 7 years ago
- A platform for online learning that curtails data latency and saves you cost.☆47Updated 3 years ago
- A primal-dual framework for distributed L1-regularized optimization☆36Updated 9 years ago
- An implementation of DistBelief using the Akka Actor framework☆83Updated 9 years ago
- A CPU and GPU-accelerated matrix library for data mining☆266Updated 4 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- Streaming Parallel Decision Tree☆54Updated last year
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).☆39Updated 7 years ago
- ☆110Updated 8 years ago
- Splash Project for parallel stochastic learning☆94Updated 8 years ago
- A Java Toolbox for Scalable Probabilistic Machine Learning☆123Updated 2 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆117Updated 3 years ago
- MacroBase: A Search Engine for Fast Data☆669Updated 2 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆79Updated last month
- Scalable Machine Learning in Scalding☆361Updated 7 years ago
- Scalable Graph Mining☆63Updated 2 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆234Updated 5 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆103Updated 4 years ago
- Scalable and Sustainable Deep Learning via Randomized Hashing☆94Updated 3 years ago
- CS294 RISE Course Material☆32Updated 6 years ago
- ☆57Updated 8 years ago
- A Profiler for Identifying the Major Sources of Performance Variance in Modern Applications☆95Updated 7 years ago
- Fine-Grained Distributed Computing☆11Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 5 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago