embryo-labs / EvaluationOfColumnarFormatsLinks
Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17
☆62Updated last year
Alternatives and similar repositories for EvaluationOfColumnarFormats
Users that are interested in EvaluationOfColumnarFormats are comparing it to the libraries listed below
Sorting:
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆136Updated 3 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆251Updated 4 months ago
- Transactional functions-as-a-service for database-oriented applications.☆154Updated last year
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆96Updated 3 weeks ago
- Query Optimizer Service☆80Updated 2 months ago
- ☆64Updated 2 years ago
- Next-Gen Big Data File Format☆440Updated 3 weeks ago
- CMU-DB's Cascades optimizer framework☆403Updated 7 months ago
- This is the source code for our (Tobias Ziegler, Carsten Binnig and Viktor Leis) published paper at SIGMOD’22: ScaleStore: A Fast and Cos…☆124Updated 10 months ago
- SIGMOD Contest 2025 Winning Solution☆34Updated 2 months ago
- ☆34Updated 3 years ago
- Distributed pushdown cache for DataFusion☆239Updated last week
- Distributed SQL Query Engine in Python using Ray☆244Updated 10 months ago
- PRISM is a UDF optimization framework that deconstructs a UDF into separate inlinable and outlinable pieces, resulting in simpler queries…☆17Updated last week
- ☆141Updated 3 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆52Updated last year
- A Database System for Research and Fast Prototyping☆105Updated 2 months ago
- ☆72Updated 5 months ago
- TPC-H benchmark data generation in pure Rust☆131Updated this week
- Prototype compiler from SaneQL to SQL☆84Updated last year
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆62Updated 11 months ago
- New file format for storage of large columnar datasets.☆586Updated 2 weeks ago
- A runtime implementation of data-parallel actors.☆38Updated 3 years ago
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- Comptaction runtime for Apache Iceberg.☆67Updated this week
- Reproducibility package for "Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP"☆22Updated last year
- A SQL query compiler written in Rust from scratch☆22Updated 11 months ago
- Apache Parquet Testing☆68Updated last week
- Tools for generating TPC-* datasets☆30Updated last year
- Lakehouse storage system benchmark☆75Updated 2 years ago