maxi-k / btrblocks
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
☆239Updated 2 weeks ago
Alternatives and similar repositories for btrblocks:
Users that are interested in btrblocks are comparing it to the libraries listed below
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆124Updated last month
- Towards a New File Format☆218Updated last month
- CMU-DB's Cascades optimizer framework☆397Updated 3 months ago
- New file format for storage of large columnar datasets.☆529Updated this week
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆45Updated 11 months ago
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆58Updated 7 months ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆254Updated 6 years ago
- Apache Iceberg C++☆63Updated this week
- BI benchmark with user generated data and queries☆65Updated 4 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 6 months ago
- TPC-H benchmark data generation in pure Rust☆50Updated this week
- ☆14Updated 2 weeks ago
- Apache DataFusion Ray☆183Updated 2 weeks ago
- Prototype compiler from SaneQL to SQL☆81Updated last year
- ☆529Updated 2 weeks ago
- A modular acceleration toolkit for big data analytic engines☆68Updated 11 months ago
- Pure Rust Iceberg Implementation☆163Updated 8 months ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆54Updated 11 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆234Updated this week
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆90Updated last month
- A native storage format for apache arrow☆82Updated last year
- ☆84Updated last week
- 10x lower latency for cloud-native DataFusion☆96Updated this week
- ☆40Updated last week
- Rust implementation of Apache Iceberg with integration for Datafusion☆163Updated last week
- Order-preserving key encoder☆122Updated 4 years ago
- Apache Parquet Testing☆57Updated this week
- This is the companion repository for the book How Query Engines Work.☆386Updated last year
- LingoDB: A new analytical database system that blurs the lines between databases and compilers.☆244Updated this week
- Ibis Substrait Compiler☆102Updated this week