stanford-futuredata / macrobaseLinks
MacroBase: A Search Engine for Fast Data
☆671Updated 2 years ago
Alternatives and similar repositories for macrobase
Users that are interested in macrobase are comparing it to the libraries listed below
Sorting:
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- ☆460Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Updated 8 years ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,036Updated 2 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Updated 7 years ago
- Enabling queries on compressed data.☆281Updated last year
- Mirror of Apache Samoa (Incubating)☆250Updated 2 years ago
- A scalable machine learning library on Apache Spark☆795Updated 4 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆261Updated last year
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 8 years ago
- Distributed Prometheus time series database☆1,457Updated this week
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆938Updated 2 years ago
- ☆110Updated 8 years ago
- ☆46Updated 8 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark☆337Updated 2 years ago
- MLDB is the Machine Learning Database☆681Updated 9 months ago
- Breakout Detection via Robust E-Statistics☆761Updated 8 years ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆587Updated last year
- Mirror of Apache Apex core☆350Updated 4 years ago
- The Heroic Time Series Database☆846Updated 4 years ago
- Berkeley Tree Database (BTrDB) server☆910Updated 4 years ago
- ☆92Updated 10 years ago
- Distributed Neural Networks for Spark☆605Updated 5 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 13 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆522Updated 3 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago