stanford-futuredata/macrobase

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanford-futuredata/macrobase)

stanford-futuredata / macrobase

MacroBase: A Search Engine for Fast Data

☆671

Alternatives and similar repositories for macrobase

Users that are interested in macrobase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanford-futuredata / readinggroup
View on GitHub
☆46Aug 28, 2017Updated 8 years ago
stanford-futuredata / ASAP
View on GitHub
ASAP: Prioritizing Attention via Time Series Smoothing
☆197Apr 5, 2018Updated 8 years ago
weld-project / weld
View on GitHub
High-performance runtime for data analytics applications
☆3,005Apr 13, 2026Updated 3 months ago
stanford-futuredata / sparser
View on GitHub
Sparser: Raw Filtering for Faster Analytics over Raw Data
☆432Sep 18, 2018Updated 7 years ago
stanford-futuredata / noscope
View on GitHub
Accelerating network inference over video
☆437Mar 6, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
filodb / FiloDB
View on GitHub
Distributed Prometheus time series database
☆1,468Updated this week
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
cmu-db / peloton
View on GitHub
The Self-Driving Database Management System
☆2,049May 15, 2019Updated 7 years ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
stanford-futuredata / msketch
View on GitHub
Moments Sketch Code
☆41Oct 31, 2018Updated 7 years ago
facebookarchive / beringei
View on GitHub
Beringei is a high performance, in-memory storage engine for time series data.
☆3,155Jul 11, 2018Updated 8 years ago
sameeragarwal / blinkdb
View on GitHub
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Feb 6, 2014Updated 12 years ago
onetapbeyond / opencpu-spark-executor
View on GitHub
Apache Spark OpenCPU Executor (ROSE)
☆25Jun 16, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
amplab / succinct
View on GitHub
Enabling queries on compressed data.
☆282Dec 16, 2023Updated 2 years ago
snorkel-team / snorkel
View on GitHub
A system for quickly generating training data with weak supervision
☆5,992Jun 8, 2026Updated last month
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
gearpump / gearpump
View on GitHub
Lightweight real-time big data streaming engine over Akka
☆756Jul 14, 2026Updated last week
BIDData / BIDMach
View on GitHub
CPU and GPU-accelerated Machine Learning Library
☆919Oct 4, 2022Updated 3 years ago
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,117Updated this week
lemire / bloofi
View on GitHub
Bloofi: A java implementation of multidimensional Bloom filters
☆86Jul 1, 2025Updated last year
ground-context / ground
View on GitHub
An open-source, vendor-neutral data context service.
☆163Mar 6, 2018Updated 8 years ago
addthis / stream-lib
View on GitHub
Stream summarizer and cardinality estimator.
☆2,265Nov 28, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
twitter-archive / distributedlog
View on GitHub
A high performance replicated log service. (The development is moved to Apache Incubator)
☆2,206Feb 25, 2020Updated 6 years ago
amplab / velox-modelserver
View on GitHub
☆110Apr 17, 2017Updated 9 years ago
Netflix / Surus
View on GitHub
☆462Mar 24, 2023Updated 3 years ago
yahoo / egads
View on GitHub
A Java package to automatically detect anomalies in large scale time-series data
☆1,190Nov 14, 2023Updated 2 years ago
airbnb / aerosolve
View on GitHub
A machine learning package built for humans.
☆4,809Nov 6, 2025Updated 8 months ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
huawei-noah / streamDM
View on GitHub
Stream Data Mining Library for Spark Streaming
☆497Apr 16, 2023Updated 3 years ago
ucbrise / clipper
View on GitHub
A low-latency prediction-serving system
☆1,421Apr 26, 2021Updated 5 years ago
heavyai / heavydb
View on GitHub
HeavyDB (formerly MapD/OmniSciDB)
☆3,055Jun 25, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bellettif / sparkGeoTS
View on GitHub
☆12Apr 8, 2016Updated 10 years ago
madlib / archived_madlib
View on GitHub
MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.
☆508Feb 9, 2018Updated 8 years ago
CorfuDB / CorfuDB
View on GitHub
A cluster consistency platform
☆666Updated this week
HazyResearch / deepdive
View on GitHub
DeepDive
☆1,979Jun 9, 2022Updated 4 years ago
h2oai / h2o-2
View on GitHub
Please visit https://github.com/h2oai/h2o-3 for latest H2O
☆2,254Oct 24, 2024Updated last year
ottogroup / SPQR
View on GitHub
Spooker is a dynamic framework for processing high volume data streams via processing pipelines
☆30Feb 1, 2016Updated 10 years ago
pinterest / terrapin
View on GitHub
Serving system for batch generated data sets
☆179May 11, 2017Updated 9 years ago