A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
☆61May 6, 2021Updated 4 years ago
Alternatives and similar repositories for datafusion-python
Users that are interested in datafusion-python are comparing it to the libraries listed below
Sorting:
- Generated Rust of Apache Arrow spec☆17Jun 13, 2023Updated 2 years ago
- Experimental support for serializing DataFusion plans using substrait☆46Jan 13, 2023Updated 3 years ago
- Python binding for DataFusion☆59Jul 22, 2022Updated 3 years ago
- Serverless query engine☆141Jan 5, 2023Updated 3 years ago
- S3 as an ObjectStore for DataFusion☆68Mar 12, 2023Updated 2 years ago
- Rust cloud object storage tools☆12Aug 9, 2021Updated 4 years ago
- A reader that buffers ranged calls☆12May 17, 2022Updated 3 years ago
- Benchmarks to read parquet to arrow☆11Dec 25, 2022Updated 3 years ago
- Robust data transformation tool using SQL☆22Dec 20, 2022Updated 3 years ago
- A set of tools for writing servers that speak PostgreSQL's wire protocol☆93Feb 11, 2026Updated 2 weeks ago
- Transmute-free Rust library to work with the Arrow format☆1,069Feb 27, 2024Updated 2 years ago
- Derive for arrow2☆67Sep 11, 2023Updated 2 years ago
- Experimental optimizer for DataFusion☆15May 29, 2021Updated 4 years ago
- ☆19Mar 24, 2018Updated 7 years ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆383Jul 31, 2024Updated last year
- Your go-to for easy access to a plethora of compression algorithms, all neatly bundled in one simple installation.☆123Sep 28, 2025Updated 5 months ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Feb 4, 2026Updated 3 weeks ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆189Feb 16, 2026Updated last week
- Pillars for Transactional Systems and Data Grids☆133Apr 15, 2024Updated last year
- Quickly view your data☆345Feb 18, 2026Updated last week
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆11Apr 23, 2022Updated 3 years ago
- Make RocksDB really rocks! The Rust style API.☆47Apr 26, 2021Updated 4 years ago
- Mcts library written in rust, for rust.☆14Oct 8, 2023Updated 2 years ago
- A minimal in-memory database with relational algebraic expressions as queries☆62Aug 23, 2021Updated 4 years ago
- Optimizer for DataFusion based on the egg framework☆15Mar 17, 2022Updated 3 years ago
- NumPy file format (de-)serialization in Rust☆30Jul 15, 2019Updated 6 years ago
- Experimental WebAssembly backend to MindSpore.☆58Jul 29, 2020Updated 5 years ago
- A rust implementation of a key-value store with Log-Structured Merge Trees.☆15Nov 12, 2021Updated 4 years ago
- Spark VCF data source implementation for Dataframes☆15Jul 15, 2022Updated 3 years ago
- Simple end-to-end encryption for webapps☆16Jan 20, 2025Updated last year
- A column-oriented, dataframe implementation for Racket.☆17Mar 30, 2025Updated 11 months ago
- ☆15Apr 1, 2021Updated 4 years ago
- Lifecycle helpers for loading and unmounting css☆15Jun 19, 2025Updated 8 months ago
- A Rust DataFrame implementation, built on Apache Arrow☆280Oct 26, 2020Updated 5 years ago
- Collaborative notepad app that is based on CRDT's☆17Nov 27, 2017Updated 8 years ago
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- The hot new standard in open databases☆30Sep 13, 2020Updated 5 years ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year