stanford-futuredata / sparserLinks
Sparser: Raw Filtering for Faster Analytics over Raw Data
☆434Updated 7 years ago
Alternatives and similar repositories for sparser
Users that are interested in sparser are comparing it to the libraries listed below
Sorting:
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago
- Enabling queries on compressed data.☆282Updated 2 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 3 years ago
- A Relational Database Backed by Apache Kafka☆389Updated 3 months ago
- This is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regressio…☆309Updated 5 years ago
- The Accelerator is a tool for fast and reproducible processing of large amounts of data.☆149Updated 3 years ago
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆425Updated 2 years ago
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆320Updated 8 months ago
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆252Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,035Updated 3 years ago
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago
- A data-driven compute platform☆1,215Updated 6 years ago
- RocksDB Replication☆680Updated last year
- The SpliceSQL Engine☆171Updated 2 years ago
- Mirror of Apache Cassandra (incubating)☆438Updated 2 years ago
- ☆448Updated 3 years ago
- A tool to mount HDFS as a local Linux file system☆289Updated 5 years ago
- Berkeley Tree Database (BTrDB) server☆911Updated 4 years ago
- Distributed storage for sequential data☆1,905Updated 4 years ago
- Apache Quickstep Incubator - This project is retired☆94Updated 7 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆115Updated 10 years ago
- Self regulation and auto-tuning for distributed system☆67Updated 2 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Real²time Exploratory Analytics on Large Datasets☆121Updated 5 years ago
- DBSeer☆115Updated 6 years ago
- A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: Hype…☆164Updated 7 months ago
- Quantcast File System☆649Updated 2 weeks ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆942Updated last week
- Apache Fluo☆194Updated 3 months ago