amazon-science / redsetLinks
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
☆62Updated 10 months ago
Alternatives and similar repositories for redset
Users that are interested in redset are comparing it to the libraries listed below
Sorting:
- ☆45Updated 2 weeks ago
- BI benchmark with user generated data and queries☆67Updated 6 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆247Updated 3 months ago
- Apache DataFusion Benchmarks☆20Updated 3 months ago
- Ibis Substrait Compiler☆103Updated this week
- Next-Gen Big Data File Format☆245Updated this week
- ☆79Updated 2 years ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆59Updated last year
- New file format for storage of large columnar datasets.☆567Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆130Updated 2 weeks ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆206Updated 3 weeks ago
- TPC-H_SF10☆53Updated 5 months ago
- ☆28Updated this week
- Apache Arrow Flight SQL adapter for PostgreSQL☆90Updated 3 months ago
- In-Memory Analytics with Apache Arrow, published by Packt☆101Updated last year
- Pollock is a benchmark for data loading on character-delimited files.☆20Updated 3 months ago
- Apache Arrow Cookbook☆104Updated 2 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 9 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆43Updated 3 months ago
- DuckDB is an in-process SQL OLAP Database Management System☆44Updated 3 weeks ago
- ☆291Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆230Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆49Updated last year
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆259Updated 9 months ago
- Apache Iceberg C++☆96Updated this week
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- TPC-H dbgen☆300Updated last year
- tpch-dbgen☆38Updated 13 years ago
- DuckDB-powered analytics in Postgres☆152Updated last year