stanford-futuredata / macrobase
MacroBase: A Search Engine for Fast Data
☆660Updated last year
Related projects: ⓘ
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 10 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,039Updated last year
- Simplifying robust end-to-end machine learning on Apache Spark.☆468Updated 7 years ago
- ☆458Updated last year
- Distributed Prometheus time series database☆1,428Updated this week
- Enabling queries on compressed data.☆276Updated 9 months ago
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- An open-source, vendor-neutral data context service.☆158Updated 6 years ago
- Vectorized processing for Apache Arrow☆486Updated 2 years ago
- Mirror of Apache Samoa (Incubating)☆246Updated last year
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆625Updated 9 months ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆515Updated 2 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆430Updated 6 years ago
- Streaming MapReduce with Scalding and Storm☆2,139Updated 2 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆921Updated 10 months ago
- [DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark☆749Updated last month
- Distributed Neural Networks for Spark☆603Updated 4 years ago
- ☆881Updated this week
- MLDB is the Machine Learning Database☆664Updated last year
- An experimental hosted platform (GitHub-like) for organizing, managing, sharing, collaborating, and making sense of data.☆210Updated 6 years ago
- ☆334Updated this week
- Sparkling Water provides H2O functionality inside Spark cluster☆961Updated 2 months ago
- SQL-based streaming analytics platform at scale☆1,222Updated 4 years ago
- Mirror of Apache Apex core☆350Updated 3 years ago
- ☆399Updated this week
- Generates more or less realistic log data for testing simple aggregation queries.☆257Updated 9 months ago
- CPU and GPU-accelerated Machine Learning Library☆915Updated last year
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 7 years ago
- A library for time series analysis on Apache Spark☆1,191Updated 3 years ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆888Updated this week