stanford-futuredata / macrobase
MacroBase: A Search Engine for Fast Data
☆663Updated 2 years ago
Alternatives and similar repositories for macrobase:
Users that are interested in macrobase are comparing it to the libraries listed below
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 7 years ago
- An open-source, vendor-neutral data context service.☆159Updated 6 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,038Updated 2 years ago
- Distributed Prometheus time series database☆1,430Updated this week
- ☆459Updated last year
- Enabling queries on compressed data.☆278Updated last year
- Vectorized processing for Apache Arrow☆484Updated 2 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆432Updated 6 years ago
- Mirror of Apache Samoa (Incubating)☆248Updated last year
- Streaming MapReduce with Scalding and Storm☆2,135Updated 2 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆515Updated 2 years ago
- A scalable machine learning library on Apache Spark☆792Updated 3 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 7 years ago
- Distributed Neural Networks for Spark☆604Updated 4 years ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆257Updated last year
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆635Updated last year
- An experimental hosted platform (GitHub-like) for organizing, managing, sharing, collaborating, and making sense of data.☆211Updated 6 years ago
- ☆110Updated 7 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆927Updated last year
- Mirror of Apache MADlib☆465Updated 8 months ago
- An efficient updatable key-value store for Apache Spark☆250Updated 7 years ago
- Scripts to analyze Spark's performance☆136Updated 6 years ago
- An open source ML system for the end-to-end data science lifecycle☆1,039Updated this week
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆901Updated this week
- A java library for stored queries☆375Updated last year