stanford-futuredata/sparser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanford-futuredata/sparser)

stanford-futuredata / sparser

Sparser: Raw Filtering for Faster Analytics over Raw Data

☆432

Alternatives and similar repositories for sparser

Users that are interested in sparser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanford-futuredata / macrobase
View on GitHub
MacroBase: A Search Engine for Fast Data
☆671Dec 14, 2022Updated 3 years ago
guillaumebort / mison
View on GitHub
Scala Mison implementation
☆15Nov 16, 2018Updated 7 years ago
stanford-futuredata / msketch
View on GitHub
Moments Sketch Code
☆41Oct 31, 2018Updated 7 years ago
hydro-project / fluent
View on GitHub
A data-driven compute platform
☆1,212Aug 9, 2019Updated 6 years ago
mklarqvist / StormBitmaps
View on GitHub
Fast algorithms for computing XX^T for binary matrices
☆14Sep 24, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
apache / datasketches-memory
View on GitHub
High performance native memory access for Java.
☆134Jul 13, 2026Updated last week
weld-project / weld
View on GitHub
High-performance runtime for data analytics applications
☆3,005Apr 13, 2026Updated 3 months ago
kjmrknsn / livy-manager
View on GitHub
Livy Manager - Web UI for Managing Apache Livy Sessions
☆16Dec 7, 2017Updated 8 years ago
dremio / gandiva
View on GitHub
Vectorized processing for Apache Arrow
☆484Feb 14, 2022Updated 4 years ago
facebookarchive / LogDevice
View on GitHub
Distributed storage for sequential data
☆1,904Oct 12, 2021Updated 4 years ago
hortonworks-spark / spark-schema-registry
View on GitHub
Schema Registry integration for Apache Spark
☆40Nov 16, 2022Updated 3 years ago
facebookarchive / beringei
View on GitHub
Beringei is a high performance, in-memory storage engine for time series data.
☆3,155Jul 11, 2018Updated 8 years ago
richardstartin / multi-matcher
View on GitHub
simple rules engine
☆93Apr 16, 2020Updated 6 years ago
hvanhovell / weld-java
View on GitHub
JVM integration for Weld
☆16Sep 24, 2018Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
pnowojski / simd-blog
View on GitHub
Source code for SIMD benchmarks and experiments in Java
☆32Jun 30, 2017Updated 9 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
pravega / pravega
View on GitHub
Pravega - Streaming as a new software defined storage primitive
☆1,998Mar 2, 2025Updated last year
apache / hawq
View on GitHub
Apache HAWQ
☆696May 16, 2024Updated 2 years ago
manuelbernhardt / akka-locality
View on GitHub
Akka extensions for exploiting locality of clustered actors
☆10Nov 14, 2019Updated 6 years ago
liquidm / druid-dumbo
View on GitHub
☆21Mar 17, 2023Updated 3 years ago
lemire / BitSliceIndex
View on GitHub
Experiments on bit-slice indexing
☆13Feb 9, 2015Updated 11 years ago
cmu-db / peloton
View on GitHub
The Self-Driving Database Management System
☆2,049May 15, 2019Updated 7 years ago
apache / incubator-retired-gearpump
View on GitHub
Mirror of Apache Gearpump (Incubating)
☆297Aug 27, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TimelyDataflow / timely-dataflow
View on GitHub
A modular implementation of timely dataflow in Rust
☆3,629Jul 14, 2026Updated last week
oracle / graphpipe
View on GitHub
Machine Learning Model Deployment Made Simple
☆714Oct 16, 2018Updated 7 years ago
DataDog / sketches-java
View on GitHub
DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.
☆132Apr 26, 2026Updated 2 months ago
FeatureBaseDB / featurebase
View on GitHub
A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…
☆2,525Feb 21, 2024Updated 2 years ago
Netflix / iceberg
View on GitHub
Iceberg is a table format for large, slow-moving tabular data
☆494Apr 10, 2023Updated 3 years ago
apache / kudu
View on GitHub
Mirror of Apache Kudu
☆1,904Updated this week
efficient / SuRF
View on GitHub
First Practical and General-purpose Range Filter
☆555Mar 11, 2022Updated 4 years ago
apache / incubator-retired-quickstep
View on GitHub
Apache Quickstep Incubator - This project is retired
☆94Dec 5, 2018Updated 7 years ago
siddhi-io / siddhi
View on GitHub
Stream Processing and Complex Event Processing Engine
☆1,590May 5, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
shunfei / indexr
View on GitHub
An open-source columnar data format designed for fast & realtime analytic with big data.
☆447Nov 16, 2022Updated 3 years ago
ucbrise / confluo
View on GitHub
Real-time Monitoring and Analysis of Data Streams
☆1,434May 20, 2022Updated 4 years ago
cswinter / LocustDB
View on GitHub
Blazingly fast analytics database that will rapidly devour all of your data.
☆1,648Apr 23, 2026Updated 2 months ago
TimeAndSpaceIO / SmoothieMap
View on GitHub
A gulp of low latency Java
☆307Dec 29, 2019Updated 6 years ago
tools4j / fix4j
View on GitHub
Attempt to build a fast zero garbage FIX engine with Java. (UNRELEASED)
☆15Jun 21, 2017Updated 9 years ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago