amazon-science / redset
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆45Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for redset
- BI benchmark with user generated data and queries☆64Updated 5 years ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆104Updated last month
- A benchmark for serverless analytic databases.☆19Updated 2 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆228Updated 6 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆42Updated 6 months ago
- An efficient storage and compute engine for both on-prem and cloud-native data analytics.☆138Updated 2 weeks ago
- tpch-dbgen☆34Updated 12 years ago
- ☆28Updated this week
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆36Updated 2 months ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆103Updated 3 years ago
- Model implementation and explorative UI for the paper "Towards Cost-Optimal Query Processing in the Cloud". Slides: https://bit.ly/37ZfeP…☆14Updated 11 months ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆45Updated 6 months ago
- A Relational Optimizer and Executor☆66Updated 3 years ago
- simd enabled column imprints☆11Updated 6 years ago
- Query engine synthesizer based on, our domain-specific language, VOILA☆12Updated 3 years ago
- ☆77Updated 2 years ago
- ☆19Updated 5 years ago
- Lakehouse storage system benchmark☆66Updated last year
- Ibis Substrait Compiler☆95Updated this week
- Towards a New File Format☆162Updated 2 months ago
- A Database System for Research and Fast Prototyping☆97Updated 3 weeks ago
- Auto-Steer☆39Updated 3 weeks ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆253Updated 6 years ago
- Pollock is a benchmark for data loading on character-delimited files.☆9Updated last year
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆72Updated this week
- ☆77Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆148Updated last week
- Reproducing TPC-DS qualification/reference results☆31Updated last year
- Reducing the cache misses of SIMD vectorization using IMV☆27Updated 2 years ago