stanford-futuredata / macrobase
MacroBase: A Search Engine for Fast Data
☆666Updated 2 years ago
Alternatives and similar repositories for macrobase:
Users that are interested in macrobase are comparing it to the libraries listed below
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 8 years ago
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- ☆460Updated 2 years ago
- An open-source, vendor-neutral data context service.☆159Updated 7 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Updated 2 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆135Updated 7 years ago
- ☆111Updated 8 years ago
- MLDB is the Machine Learning Database☆676Updated 3 months ago
- Enabling queries on compressed data.☆279Updated last year
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆517Updated 3 years ago
- Mirror of Apache Samoa (Incubating)☆248Updated 2 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆932Updated last year
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆432Updated 6 years ago
- Distributed Prometheus time series database☆1,440Updated last week
- Implementations of the Portable Format for Analytics (PFA)☆128Updated 2 years ago
- Sparrow scheduling platform (U.C. Berkeley).☆319Updated 4 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Mirror of Apache Apex core☆348Updated 3 years ago
- CPU and GPU-accelerated Machine Learning Library☆913Updated 2 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,172Updated 5 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆639Updated last year
- SQL-based streaming analytics platform at scale☆1,224Updated 4 years ago
- Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark☆338Updated 2 years ago
- Breakout Detection via Robust E-Statistics☆759Updated 7 years ago
- Scripts to analyze Spark's performance☆136Updated 6 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- Distributed Neural Networks for Spark☆604Updated 4 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Parallel ML System - Bosen project☆960Updated last year