stanford-futuredata / macrobaseView external linksLinks
MacroBase: A Search Engine for Fast Data
☆672Dec 14, 2022Updated 3 years ago
Alternatives and similar repositories for macrobase
Users that are interested in macrobase are comparing it to the libraries listed below
Sorting:
- ☆46Aug 28, 2017Updated 8 years ago
- High-performance runtime for data analytics applications☆3,007Jun 22, 2022Updated 3 years ago
- ASAP: Prioritizing Attention via Time Series Smoothing☆197Apr 5, 2018Updated 7 years ago
- Distributed Prometheus time series database☆1,464Updated this week
- Accelerating network inference over video☆436Mar 6, 2020Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆476Apr 18, 2017Updated 8 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆433Sep 18, 2018Updated 7 years ago
- The Self-Driving Database Management System☆2,049May 15, 2019Updated 6 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,700Mar 1, 2023Updated 2 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Beringei is a high performance, in-memory storage engine for time series data.☆3,170Jul 11, 2018Updated 7 years ago
- Lightweight real-time big data streaming engine over Akka☆758Mar 1, 2022Updated 3 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- A system for quickly generating training data with weak supervision☆5,938May 2, 2024Updated last year
- Bloofi: A java implementation of multidimensional Bloom filters☆85Jul 1, 2025Updated 7 months ago
- Apache Pinot - A realtime distributed OLAP datastore☆6,024Feb 9, 2026Updated last week
- An open-source, vendor-neutral data context service.☆161Mar 6, 2018Updated 7 years ago
- Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams☆246Mar 5, 2017Updated 8 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- A machine learning package built for humans.☆4,799Nov 6, 2025Updated 3 months ago
- Moments Sketch Code☆40Oct 31, 2018Updated 7 years ago
- CPU and GPU-accelerated Machine Learning Library☆920Oct 4, 2022Updated 3 years ago
- A library for time series analysis on Apache Spark☆1,195Oct 13, 2020Updated 5 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,208Feb 25, 2020Updated 5 years ago
- Immutable DataTable implementation in Scala☆71Dec 30, 2019Updated 6 years ago
- Stream Data Mining Library for Spark Streaming☆500Apr 16, 2023Updated 2 years ago
- Serving system for batch generated data sets☆177May 11, 2017Updated 8 years ago
- HeavyDB (formerly MapD/OmniSciDB)☆3,057Jan 6, 2026Updated last month
- A language and runtime for distributed, incremental data processing in the cloud☆974Oct 18, 2023Updated 2 years ago
- ☆461Mar 24, 2023Updated 2 years ago
- Visualizations for machine learning datasets☆7,370May 24, 2023Updated 2 years ago
- A cluster consistency platform☆661Feb 3, 2026Updated last week
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,189Nov 14, 2023Updated 2 years ago
- SQL-based streaming analytics platform at scale☆1,226Jun 21, 2020Updated 5 years ago