embryo-labs / EvaluationOfColumnarFormatsLinks
Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17
☆60Updated last year
Alternatives and similar repositories for EvaluationOfColumnarFormats
Users that are interested in EvaluationOfColumnarFormats are comparing it to the libraries listed below
Sorting:
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆131Updated last month
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆251Updated 3 months ago
- Transactional functions-as-a-service for database-oriented applications.☆152Updated last year
- ☆64Updated 2 years ago
- This is the source code for our (Tobias Ziegler, Carsten Binnig and Viktor Leis) published paper at SIGMOD’22: ScaleStore: A Fast and Cos…☆124Updated 9 months ago
- ☆33Updated 3 years ago
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆94Updated last month
- Reproducibility package for "Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP"☆22Updated last year
- Query Optimizer Service☆75Updated 2 months ago
- SIGMOD Contest 2025 Winning Solution☆31Updated last month
- A Database System for Research and Fast Prototyping☆106Updated 2 months ago
- Strategically Deconstruct UDFs with PRISM for Faster Query Plans☆17Updated 8 months ago
- Implementation and artifacts for "User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases"☆24Updated last year
- Tools for generating TPC-* datasets☆29Updated last year
- ☆138Updated 3 years ago
- A runtime implementation of data-parallel actors.☆38Updated 3 years ago
- ☆72Updated 4 months ago
- CMU-DB's Cascades optimizer framework☆402Updated 6 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆50Updated last year
- In-memory, columnar, arrow-based database.☆48Updated 2 years ago
- Next-Gen Big Data File Format☆394Updated this week
- Low-Latency Transaction Scheduling via Userspace Interrupts: Why Wait or Yield When You Can Preempt? (SIGMOD 2025 Best Paper Award)☆63Updated 3 months ago
- Distributed pushdown cache for DataFusion☆216Updated last week
- Lakehouse storage system benchmark☆76Updated 2 years ago
- a high performance cache simulator and library☆106Updated 11 months ago
- Prototype compiler from SaneQL to SQL☆83Updated last year
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆63Updated 10 months ago
- ☆10Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 9 months ago
- TPC-H benchmark data generation in pure Rust☆118Updated this week