amazon-science / redsetLinks
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆62Updated last year
Alternatives and similar repositories for redset
Users that are interested in redset are comparing it to the libraries listed below
Sorting:
- BI benchmark with user generated data and queries☆71Updated 8 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆255Updated 5 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆137Updated last week
- Apache DataFusion Benchmarks☆21Updated 5 months ago
- ☆48Updated 2 months ago
- A benchmark for serverless analytic databases.☆23Updated 11 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆52Updated last year
- Next-Gen Big Data File Format☆477Updated this week
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆261Updated 7 years ago
- Ibis Substrait Compiler☆105Updated this week
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆62Updated last year
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆113Updated 3 years ago
- New file format for storage of large columnar datasets.☆607Updated this week
- TPC-H benchmark data generation in pure Rust☆165Updated last week
- ☆10Updated last year
- ☆80Updated 2 years ago
- Reproducing TPC-DS qualification/reference results☆32Updated 2 years ago
- Lakehouse storage system benchmark☆76Updated 2 years ago
- A Relational Optimizer and Executor☆66Updated 8 months ago
- Apache Iceberg C++☆110Updated last week
- Multi-DBMS SQL Benchmarking Framework via JDBC☆585Updated 4 months ago
- TPC-H dbgen☆314Updated 2 years ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆44Updated 5 months ago
- PRISM is a UDF optimization framework that deconstructs a UDF into separate inlinable and outlinable pieces, resulting in simpler queries…☆17Updated last month
- Core C++ Sketch Library☆238Updated 3 weeks ago
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆98Updated last month
- Reproducibility package for "Robust Join Processing with Diamond Hardened Joins"☆12Updated last year
- Model implementation and explorative UI for the paper "Towards Cost-Optimal Query Processing in the Cloud". Slides: https://bit.ly/37ZfeP…☆15Updated last year
- Distributed pushdown cache for DataFusion☆261Updated this week
- tpch-dbgen☆38Updated 13 years ago