jorgecarleitao / datafusion-python
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
☆60Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for datafusion-python
- Experimental support for serializing DataFusion plans using substrait☆44Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Arrow, pydantic style☆82Updated last year
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆50Updated this week
- JSON support for DataFusion (unofficial)☆28Updated last week
- ☆53Updated 6 months ago
- A batteries included data processing and DataFusion development app for the terminal☆111Updated this week
- A purely experimental DuckDB Deltalake extension☆94Updated this week
- S3 as an ObjectStore for DataFusion☆59Updated last year
- Apache Arrow Ballista Python bindings☆33Updated 8 months ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆68Updated last month
- A DataFusion-powered Serverless S3 Proxy.☆14Updated 6 months ago
- DataFusion TableProviders for reading data from other systems☆59Updated this week
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆355Updated 3 months ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆58Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆99Updated this week
- Rust lib to read from Apache ORC☆18Updated last year
- Robust data transformation tool using SQL☆20Updated last year
- Serverless query engine☆139Updated last year
- Generated Rust of Apache Arrow spec☆16Updated last year
- Ibis Substrait Compiler☆95Updated this week
- Coming soon☆58Updated last year
- Query Plan Markup Language☆45Updated 9 months ago
- Derive for arrow2☆65Updated last year
- ☆21Updated 6 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆73Updated this week
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆9Updated 8 months ago
- ☆27Updated this week
- Optimizer for DataFusion based on the egg framework☆12Updated 2 years ago