amazon-science / redsetLinks
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆63Updated last year
Alternatives and similar repositories for redset
Users that are interested in redset are comparing it to the libraries listed below
Sorting:
- BI benchmark with user generated data and queries☆71Updated 11 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆270Updated 7 months ago
- ☆53Updated 5 months ago
- Ibis Substrait Compiler☆105Updated this week
- TPC-H benchmark data generation in pure Rust☆212Updated last week
- Apache DataFusion Benchmarks☆22Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆142Updated 2 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55Updated last year
- Next-Gen Big Data File Format☆522Updated last month
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆44Updated 8 months ago
- [VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"☆67Updated last month
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- TPC-H dbgen☆315Updated 2 years ago
- New file format for storage of large columnar datasets.☆647Updated last week
- ☆80Updated 3 years ago
- tpch-dbgen☆38Updated 13 years ago
- A benchmark for serverless analytic databases.☆23Updated last year
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆264Updated 7 years ago
- ☆344Updated this week
- Apache DataFusion Ray☆223Updated last month
- Apache Iceberg C++☆158Updated this week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆87Updated last month
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆254Updated last week
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆115Updated 4 years ago
- Distributed pushdown cache for DataFusion☆340Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆244Updated last week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆276Updated last year
- ☆10Updated last year
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- Reproducing TPC-DS qualification/reference results☆34Updated 2 years ago