amazon-science / redsetLinks
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆65Updated last year
Alternatives and similar repositories for redset
Users that are interested in redset are comparing it to the libraries listed below
Sorting:
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆277Updated 9 months ago
- BI benchmark with user generated data and queries☆72Updated last year
- ☆53Updated last month
- Apache DataFusion Benchmarks☆23Updated 2 weeks ago
- Ibis Substrait Compiler☆108Updated last week
- Next-Gen Big Data File Format☆640Updated 3 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆44Updated 3 weeks ago
- TPC-H benchmark data generation in pure Rust☆221Updated this week
- [VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"☆69Updated 3 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆144Updated 2 weeks ago
- A benchmark for serverless analytic databases.☆25Updated last year
- tpch-dbgen☆38Updated 13 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55Updated last year
- TPC-H dbgen☆323Updated 2 years ago
- Reproducing TPC-DS qualification/reference results☆34Updated 2 years ago
- ☆80Updated 3 years ago
- Lakehouse storage system benchmark☆77Updated 2 years ago
- New file format for storage of large columnar datasets.☆674Updated this week
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆266Updated 7 years ago
- ☆365Updated last week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆256Updated 3 weeks ago
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Apache Iceberg C++☆182Updated this week
- PRISM is a UDF optimization framework that deconstructs a UDF into separate inlinable and outlinable pieces, resulting in simpler queries…☆18Updated 5 months ago
- Core C++ Sketch Library☆253Updated this week
- ☆11Updated last year
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆374Updated 2 months ago
- CMU-DB's Cascades optimizer framework☆403Updated last year
- TPC-H_SF10☆53Updated 11 months ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆115Updated 4 years ago