stanford-futuredata / macrobaseLinks
MacroBase: A Search Engine for Fast Data
☆668Updated 2 years ago
Alternatives and similar repositories for macrobase
Users that are interested in macrobase are comparing it to the libraries listed below
Sorting:
- Simplifying robust end-to-end machine learning on Apache Spark.☆472Updated 8 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Updated 2 years ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- ☆460Updated 2 years ago
- Enabling queries on compressed data.☆279Updated last year
- ☆110Updated 8 years ago
- A scalable machine learning library on Apache Spark☆796Updated 3 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Distributed Prometheus time series database☆1,444Updated this week
- MLDB is the Machine Learning Database☆676Updated 5 months ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Updated 6 years ago
- Mirror of Apache Samoa (Incubating)☆249Updated 2 years ago
- Streaming MapReduce with Scalding and Storm☆2,132Updated 3 years ago
- Distributed Neural Networks for Spark☆604Updated 4 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Updated 7 years ago
- The Heroic Time Series Database☆846Updated 4 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆259Updated last year
- Mirror of Apache Apex core☆348Updated 4 years ago
- Breakout Detection via Robust E-Statistics☆759Updated 7 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆518Updated 3 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆640Updated last year
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆857Updated 4 years ago
- Iceberg is a table format for large, slow-moving tabular data☆481Updated 2 years ago
- CPU and GPU-accelerated Machine Learning Library☆914Updated 2 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 5 years ago
- A library for time series analysis on Apache Spark☆1,193Updated 4 years ago
- A java library for stored queries☆376Updated 2 years ago
- Berkeley Tree Database (BTrDB) server☆910Updated 3 years ago
- ☆92Updated 9 years ago