jorgecarleitao / datafusion-pythonLinks
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
☆61Updated 4 years ago
Alternatives and similar repositories for datafusion-python
Users that are interested in datafusion-python are comparing it to the libraries listed below
Sorting:
- Experimental support for serializing DataFusion plans using substrait☆46Updated 3 years ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆188Updated last week
- Python binding for DataFusion☆59Updated 3 years ago
- Arrow, pydantic style☆85Updated 3 years ago
- S3 as an ObjectStore for DataFusion☆67Updated 2 years ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆75Updated 2 months ago
- Robust data transformation tool using SQL☆22Updated 3 years ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆383Updated last year
- Serverless query engine☆141Updated 3 years ago
- Derive for arrow2☆67Updated 2 years ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆85Updated last week
- ☆55Updated last year
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆67Updated last week
- Rust implementation of the FastLanes compression library☆160Updated last week
- Generated Rust of Apache Arrow spec☆17Updated 2 years ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- A set of tools for writing servers that speak PostgreSQL's wire protocol☆93Updated last week
- An experimental (work-in-progress) statically typed implementation of Apache Arrow☆28Updated last week
- results cache for Apache DataFusion☆32Updated last year
- Boring Data Tool☆239Updated last year
- Query Plan Markup Language☆45Updated 2 years ago
- ☆22Updated last year
- Postgres protocol frontend for DataFusion☆126Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- A modular implementation of timely dataflow in Rust☆119Updated 2 weeks ago
- ☆19Updated 7 years ago
- Example of using the Apache Arrow C Data Interface between Python and Rust☆24Updated last year
- A reader that buffers ranged calls☆12Updated 3 years ago
- Rust lib to read from Apache ORC☆18Updated 2 years ago
- JSON support for DataFusion (unofficial)☆55Updated 2 weeks ago