jorgecarleitao / datafusion-python
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
☆61Updated 3 years ago
Alternatives and similar repositories for datafusion-python:
Users that are interested in datafusion-python are comparing it to the libraries listed below
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Arrow, pydantic style☆82Updated 2 years ago
- Python binding for DataFusion☆59Updated 2 years ago
- ☆55Updated 11 months ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆66Updated this week
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆65Updated this week
- Query Plan Markup Language☆45Updated last year
- Generated Rust of Apache Arrow spec☆17Updated last year
- S3 as an ObjectStore for DataFusion☆61Updated 2 years ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆144Updated this week
- ☆38Updated this week
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆359Updated 8 months ago
- DataFusion TableProviders for reading data from other systems☆102Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆162Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- JSON support for DataFusion (unofficial)☆38Updated 3 weeks ago
- Rust implementation of the FastLanes compression library☆98Updated 2 weeks ago
- Optimizer for DataFusion based on the egg framework☆13Updated 3 years ago
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆29Updated last week
- Derive for arrow2☆66Updated last year
- An experimental (work-in-progress) statically typed implementation of Apache Arrow☆18Updated last week
- Robust data transformation tool using SQL☆21Updated 2 years ago
- Apache Arrow PostgreSQL connector☆59Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Apache Arrow Ballista Python bindings☆37Updated last year
- Ibis Substrait Compiler☆102Updated this week
- Connecting DataFusion to HDFS based on libhdfs3☆13Updated 3 years ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆82Updated 3 weeks ago
- WASM bindings for DataFusion☆20Updated this week
- results cache for Apache DataFusion☆22Updated 5 months ago