amazon-science / redsetLinks
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆61Updated 9 months ago
Alternatives and similar repositories for redset
Users that are interested in redset are comparing it to the libraries listed below
Sorting:
- BI benchmark with user generated data and queries☆66Updated 6 months ago
- Apache DataFusion Benchmarks☆19Updated 2 months ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆112Updated 3 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆48Updated last year
- ☆42Updated last month
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆129Updated 2 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆245Updated 2 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆43Updated 2 months ago
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- Ibis Substrait Compiler☆103Updated this week
- Prototype compiler from SaneQL to SQL☆82Updated last year
- ☆79Updated 2 years ago
- A benchmark for serverless analytic databases.☆22Updated 9 months ago
- Pollock is a benchmark for data loading on character-delimited files.☆18Updated 2 months ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆57Updated last year
- Model implementation and explorative UI for the paper "Towards Cost-Optimal Query Processing in the Cloud". Slides: https://bit.ly/37ZfeP…☆15Updated last year
- Implementation and artifacts for "User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases"☆24Updated last year
- Lakehouse storage system benchmark☆75Updated 2 years ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆257Updated 6 years ago
- A Database System for Research and Fast Prototyping☆104Updated 3 weeks ago
- Quickstep project☆42Updated 7 months ago
- ☆29Updated 5 years ago
- tpch-dbgen☆38Updated 12 years ago
- Reducing the cache misses of SIMD vectorization using IMV☆28Updated 2 years ago
- Apache Iceberg C++☆87Updated this week
- Next-Gen Big Data File Format☆233Updated this week
- DuckDB is an in-process SQL OLAP Database Management System☆44Updated 3 weeks ago
- ☆26Updated this week
- Query engine synthesizer based on, our domain-specific language, VOILA☆13Updated 4 years ago
- Reproducibility package for "Robust Join Processing with Diamond Hardened Joins"☆12Updated 11 months ago