jorgecarleitao / datafusion-python
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
☆60Updated 3 years ago
Alternatives and similar repositories for datafusion-python:
Users that are interested in datafusion-python are comparing it to the libraries listed below
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Arrow, pydantic style☆84Updated 2 years ago
- Python binding for DataFusion☆59Updated 2 years ago
- ☆53Updated 9 months ago
- An opinionated and batteries included DataFusion implementation.☆133Updated 2 weeks ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆62Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Query Plan Markup Language☆45Updated last year
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆358Updated 6 months ago
- DataFusion TableProviders for reading data from other systems☆81Updated this week
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆63Updated this week
- JSON support for DataFusion (unofficial)☆34Updated last week
- Derive for arrow2☆65Updated last year
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆96Updated this week
- Generated Rust of Apache Arrow spec☆16Updated last year
- Rust implementation of the FastLanes compression library☆94Updated last month
- Ibis Substrait Compiler☆98Updated this week
- µWheel DataFusion Optimizer for speeding up time-based analytics☆30Updated last month
- ☆34Updated last week
- Optimizer for DataFusion based on the egg framework☆13Updated 2 years ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆145Updated this week
- Apache Arrow Ballista Python bindings☆37Updated last year
- Apache Arrow Flight SQL adapter for PostgreSQL☆75Updated last month
- A native Delta implementation for integration with any query engine☆188Updated this week
- Boring Data Tool☆213Updated 10 months ago
- ☆21Updated 9 months ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Serverless query engine☆140Updated 2 years ago
- Apache DataFusion Benchmarks☆16Updated 3 months ago
- Convert sequences of Rust objects to Arrow tables☆74Updated 2 weeks ago