ljishen / tpch-dataLinks
Generate tpch data in parquet format
☆14Updated 2 years ago
Alternatives and similar repositories for tpch-data
Users that are interested in tpch-data are comparing it to the libraries listed below
Sorting:
- Ibis Substrait Compiler☆103Updated this week
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆62Updated 10 months ago
- Apache DataFusion Python Bindings☆464Updated last week
- Apache DataFusion Benchmarks☆20Updated 3 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 9 months ago
- Distributed SQL Engine in Python using Dask☆406Updated 10 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆230Updated this week
- Apache DataFusion Ray☆207Updated 3 months ago
- ☆291Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆466Updated this week
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆259Updated 9 months ago
- ☆45Updated 2 weeks ago
- A native Delta implementation for integration with any query engine☆236Updated this week
- TPC-H_SF10☆53Updated 5 months ago
- New file format for storage of large columnar datasets.☆567Updated this week
- ☆79Updated 2 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆247Updated 3 months ago
- TPC-H dbgen☆300Updated last year
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,355Updated 3 weeks ago
- Pythonic Iceberg REST Catalog☆2Updated 3 weeks ago
- A Singer.io target for DuckDB☆18Updated 4 months ago
- tpch-dbgen☆38Updated 13 years ago
- Distributed pushdown cache for DataFusion☆189Updated this week
- Catalog, compose, and ship ML—Python simplicity, SQL scale.☆305Updated this week
- DuckDB extension for Delta Lake☆193Updated this week
- Python bindings for sqlparser-rs☆191Updated last month
- Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark☆41Updated 2 months ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆258Updated 6 years ago