VDalibard / BOAT
☆52Updated 6 years ago
Alternatives and similar repositories for BOAT:
Users that are interested in BOAT are comparing it to the libraries listed below
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- ☆46Updated 7 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- Your worst case is our best case.☆137Updated 8 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 5 years ago
- Secondary index on HBase☆18Updated 9 years ago
- communication-efficient distributed coordinate ascent☆91Updated 6 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Panorama: Capturing and Enhancing In Situ System Observability for Failure Detection☆117Updated 4 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated last year
- An experimental distributed execution engine☆22Updated 4 years ago
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).☆38Updated 6 years ago
- Fine-Grained Distributed Computing☆11Updated 9 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Single-threaded graph computation in Rust☆248Updated 6 years ago
- ☆27Updated 8 years ago
- Implements the Karnin-Lang-Liberty (KLL) algorithm in python☆54Updated 2 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- S-Store Transactional Streaming Data Management System☆22Updated 4 years ago
- Persistent Adaptive Radix Trees in Java☆81Updated 4 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Apache Quickstep Incubator - This project is retired☆95Updated 6 years ago
- Behrooz File System (BFS)☆54Updated 9 years ago
- Distributed Matrix Library☆71Updated 8 years ago
- !!!!!DEPRECATED!!!! distributed machine learning benchmark - a public benchmark of distributed ML solvers and frameworks☆40Updated 6 years ago
- Simplified Moment Sketch Implemntation☆36Updated 6 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago