MacroBase: A Search Engine for Fast Data
☆671Dec 14, 2022Updated 3 years ago
Alternatives and similar repositories for macrobase
Users that are interested in macrobase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆46Aug 28, 2017Updated 8 years ago
- ASAP: Prioritizing Attention via Time Series Smoothing☆197Apr 5, 2018Updated 8 years ago
- High-performance runtime for data analytics applications☆3,004Apr 13, 2026Updated 2 months ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆432Sep 18, 2018Updated 7 years ago
- Distributed Prometheus time series database☆1,463Jun 25, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,032Nov 21, 2022Updated 3 years ago
- The Self-Driving Database Management System☆2,051May 15, 2019Updated 7 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,630Mar 1, 2023Updated 3 years ago
- Moments Sketch Code☆41Oct 31, 2018Updated 7 years ago
- Beringei is a high performance, in-memory storage engine for time series data.☆3,154Jul 11, 2018Updated 7 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Apache Spark OpenCPU Executor (ROSE)☆25Jun 16, 2018Updated 8 years ago
- A system for quickly generating training data with weak supervision☆5,982Jun 8, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open-source, vendor-neutral data context service.☆162Mar 6, 2018Updated 8 years ago
- Lightweight real-time big data streaming engine over Akka☆757Updated this week
- Apache Pinot - A realtime distributed OLAP datastore☆6,100Updated this week
- Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams☆245Mar 5, 2017Updated 9 years ago
- CPU and GPU-accelerated Machine Learning Library☆919Oct 4, 2022Updated 3 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,207Feb 25, 2020Updated 6 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆86Jul 1, 2025Updated last year
- ☆110Apr 17, 2017Updated 9 years ago
- ☆462Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A library for time series analysis on Apache Spark☆1,197Oct 13, 2020Updated 5 years ago
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- A machine learning package built for humans.☆4,804Nov 6, 2025Updated 7 months ago
- A Java package to automatically detect anomalies in large scale time-series data☆1,189Nov 14, 2023Updated 2 years ago
- HeavyDB (formerly MapD/OmniSciDB)☆3,056Updated this week
- A cluster consistency platform☆666Jun 19, 2026Updated last week
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- A low-latency prediction-serving system☆1,422Apr 26, 2021Updated 5 years ago
- ☆12Apr 8, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Serving system for batch generated data sets☆179May 11, 2017Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- Immutable DataTable implementation in Scala☆70Dec 30, 2019Updated 6 years ago
- DeepDive☆1,979Jun 9, 2022Updated 4 years ago
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆940Nov 19, 2023Updated 2 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆96Apr 4, 2019Updated 7 years ago