stanford-futuredata / macrobaseLinks
MacroBase: A Search Engine for Fast Data
☆669Updated 2 years ago
Alternatives and similar repositories for macrobase
Users that are interested in macrobase are comparing it to the libraries listed below
Sorting:
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Updated 6 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆472Updated 8 years ago
- ☆459Updated 2 years ago
- Enabling queries on compressed data.☆280Updated last year
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,036Updated 2 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Mirror of Apache Samoa (Incubating)☆249Updated 2 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 7 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆259Updated last year
- ☆110Updated 8 years ago
- A scalable machine learning library on Apache Spark☆796Updated 3 years ago
- Distributed Prometheus time series database☆1,449Updated last week
- Mirror of Apache Apex core☆348Updated 4 years ago
- Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark☆338Updated 2 years ago
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆250Updated 4 years ago
- Streaming MapReduce with Scalding and Storm☆2,130Updated 3 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago
- An experimental hosted platform (GitHub-like) for organizing, managing, sharing, collaborating, and making sense of data.☆212Updated 7 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- Transactional Distributed Database Layer☆59Updated 8 months ago
- Mirror of Apache Giraph☆617Updated 2 years ago
- An efficient updatable key-value store for Apache Spark☆251Updated 8 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆640Updated last year
- Self regulation and auto-tuning for distributed system☆65Updated 2 years ago
- A java library for stored queries☆376Updated 2 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆934Updated last year
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆520Updated 3 years ago
- A collection of algorithms for mining data streams☆204Updated last year