apache / datafusion-benchmarks
Apache DataFusion Benchmarks
☆17Updated 5 months ago
Alternatives and similar repositories for datafusion-benchmarks:
Users that are interested in datafusion-benchmarks are comparing it to the libraries listed below
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- ☆36Updated 2 weeks ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆39Updated 6 months ago
- Ibis Substrait Compiler☆100Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆58Updated 6 months ago
- Apache DataFusion Ray☆180Updated 3 weeks ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆32Updated 2 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆106Updated last month
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆17Updated last month
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- ☆244Updated this week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆79Updated 3 weeks ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆157Updated this week
- TPC-H_SF10☆52Updated 2 months ago
- ☆105Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆180Updated 2 weeks ago
- A native Delta implementation for integration with any query engine☆207Updated this week
- ☆33Updated 2 years ago
- Arrow, pydantic style☆82Updated 2 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆238Updated 10 months ago
- ☆15Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 6 months ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆81Updated last week
- DuckDB extension for Delta Lake☆173Updated last week
- Point-in-Time optimizations for Apache Spark☆29Updated last year
- A dbt adapter for Decodable☆12Updated last month
- Boring Data Tool☆214Updated last year
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- ☆25Updated last week