amazon-science / redset
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆58Updated 6 months ago
Alternatives and similar repositories for redset:
Users that are interested in redset are comparing it to the libraries listed below
- BI benchmark with user generated data and queries☆64Updated 3 months ago
- ☆37Updated last week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆238Updated 10 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆121Updated last week
- Apache Iceberg C++☆51Updated this week
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆45Updated 10 months ago
- Towards a New File Format☆210Updated 3 weeks ago
- Apache DataFusion Benchmarks☆17Updated 4 months ago
- A benchmark for serverless analytic databases.☆20Updated 6 months ago
- ☆26Updated 5 years ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆51Updated 10 months ago
- tpch-dbgen☆38Updated 12 years ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆254Updated 6 years ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆111Updated 3 years ago
- ⚡ Faster vector search with PDX: A vertical data layout for vectors☆27Updated 2 weeks ago
- Ibis Substrait Compiler☆100Updated this week
- A Database System for Research and Fast Prototyping☆102Updated 2 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆40Updated last month
- Lakehouse storage system benchmark☆72Updated 2 years ago
- Pollock is a benchmark for data loading on character-delimited files.☆10Updated last year
- ☆79Updated 2 years ago
- A modular acceleration toolkit for big data analytic engines☆68Updated 10 months ago
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- An efficient storage and compute engine for both on-prem and cloud-native data analytics.☆143Updated last week
- Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"☆14Updated last year
- Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engin…☆138Updated 2 years ago
- Auto-Steer☆45Updated 3 months ago
- ☆82Updated this week
- simd enabled column imprints☆11Updated 7 years ago
- ☆30Updated 2 years ago