stanford-futuredata / sparser
Sparser: Raw Filtering for Faster Analytics over Raw Data
☆432Updated 6 years ago
Alternatives and similar repositories for sparser:
Users that are interested in sparser are comparing it to the libraries listed below
- Vectorized processing for Apache Arrow☆484Updated 2 years ago
- A Relational Database Backed by Apache Kafka☆390Updated last week
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 10 years ago
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆415Updated last year
- ☆448Updated 2 years ago
- MacroBase: A Search Engine for Fast Data☆664Updated 2 years ago
- RocksDB Replication☆667Updated 7 months ago
- Enabling queries on compressed data.☆278Updated last year
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,038Updated 2 years ago
- A Scalable Concurrent Key-Value Map for Big Data Analytics☆269Updated last year
- Distributed storage for sequential data☆1,899Updated 3 years ago
- This is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regressio…☆311Updated 4 years ago
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- NoSQL data store using the SEASTAR framework, compatible with Redis☆1,315Updated 5 years ago
- A data-driven compute platform☆1,214Updated 5 years ago
- Trinity IR Infrastructure☆237Updated 5 years ago
- A tool to mount HDFS as a local Linux file system☆288Updated 4 years ago
- The Accelerator is a tool for fast and reproducible processing of large amounts of data.☆150Updated 2 years ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,078Updated 2 years ago
- Mirror of Apache Cassandra (incubating)☆443Updated last year
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆424Updated 3 years ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- FishStore is a prototype fast ingestion and querying layer for flexible-schema data☆217Updated last year
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆113Updated 9 years ago
- A record-oriented store built on FoundationDB☆598Updated this week
- First Practical and General-purpose Range Filter☆536Updated 2 years ago
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆309Updated 3 weeks ago
- Bistro is a flexible distributed scheduler, a high-performance framework supporting multiple paradigms while retaining ease of configurat…☆1,033Updated last year
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆248Updated 4 years ago
- A fast key/value store that is efficient for high-volume random access reads and writes.☆355Updated 7 years ago