amazon-science / redset
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆58Updated 7 months ago
Alternatives and similar repositories for redset:
Users that are interested in redset are comparing it to the libraries listed below
- BI benchmark with user generated data and queries☆65Updated 4 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆239Updated 2 weeks ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆124Updated last month
- Apache DataFusion Benchmarks☆18Updated 2 weeks ago
- A benchmark for serverless analytic databases.☆20Updated 7 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆45Updated 11 months ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆111Updated 3 years ago
- Apache Iceberg C++☆63Updated this week
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆42Updated 3 weeks ago
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆254Updated 6 years ago
- Towards a New File Format☆218Updated last month
- ☆40Updated last week
- A Database System for Research and Fast Prototyping☆102Updated 3 weeks ago
- Ibis Substrait Compiler☆102Updated this week
- Reducing the cache misses of SIMD vectorization using IMV☆28Updated 2 years ago
- Pollock is a benchmark for data loading on character-delimited files.☆11Updated last week
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆54Updated 11 months ago
- ☆79Updated 2 years ago
- tpch-dbgen☆38Updated 12 years ago
- Implementation and artifacts for "User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases"☆23Updated last year
- [SIGMOD'25] Source code for the paper: Debunking the Myth of Join Ordering: Toward Robust SQL Analytics☆12Updated last month
- Lakehouse storage system benchmark☆73Updated 2 years ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆46Updated 3 years ago
- ☆28Updated 5 years ago
- A Relational Optimizer and Executor☆66Updated 4 months ago
- ☆71Updated 2 years ago
- ☆72Updated 3 weeks ago
- ☆24Updated 3 years ago
- A modular acceleration toolkit for big data analytic engines☆68Updated 11 months ago