ljishen / tpch-dataLinks
Generate tpch data in parquet format
☆15Updated 3 years ago
Alternatives and similar repositories for tpch-data
Users that are interested in tpch-data are comparing it to the libraries listed below
Sorting:
- Ibis Substrait Compiler☆109Updated last week
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆68Updated last year
- Distributed SQL Query Engine in Python using Ray☆245Updated last year
- Apache DataFusion Ray☆229Updated 4 months ago
- Apache DataFusion Python Bindings☆558Updated this week
- Apache DataFusion Benchmarks☆24Updated last month
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated last week
- ☆11Updated last month
- ☆56Updated last month
- TPC-H_SF10☆53Updated last year
- ☆376Updated this week
- Distributed SQL Engine in Python using Dask☆409Updated last year
- Database connectivity API standard and libraries for Apache Arrow☆545Updated this week
- ☆80Updated 3 years ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 3 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆278Updated 10 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,464Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Updated last year
- Implements the [TPCH benchmark](http://www.tpc.org/tpch/) for Postgres☆30Updated 3 years ago
- New file format for storage of large columnar datasets.☆690Updated this week
- This is the companion repository for the book How Query Engines Work.☆422Updated 2 weeks ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆88Updated 3 months ago
- TPC-H benchmark data generation in pure Rust☆226Updated last week
- A native Delta implementation for integration with any query engine☆312Updated this week
- A benchmark for serverless analytic databases.☆25Updated 2 weeks ago
- Query Optimizer Service☆95Updated 8 months ago
- Documentation for Hyper, the blazingly fast SQL engine powering analytics at Tableau and Salesforce☆32Updated 2 weeks ago
- Distributed pushdown cache for DataFusion☆377Updated 2 weeks ago
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆301Updated last week