apache / datafusion-pythonLinks
Apache DataFusion Python Bindings
☆459Updated this week
Alternatives and similar repositories for datafusion-python
Users that are interested in datafusion-python are comparing it to the libraries listed below
Sorting:
- Database connectivity API standard and libraries for Apache Arrow☆458Updated this week
- A native Delta implementation for integration with any query engine☆234Updated this week
- Apache DataFusion Ray☆202Updated 2 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆223Updated last week
- ☆284Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆255Updated 8 months ago
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- Apache Iceberg☆990Updated this week
- Quickly view your data☆315Updated last week
- DuckDB extension for Delta Lake☆192Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 8 months ago
- Turning PySpark Into a Universal DataFrame API☆407Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆200Updated this week
- declarative, multi-engine data framework☆288Updated this week
- Ibis Substrait Compiler☆103Updated this week
- Apache PyIceberg☆782Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆734Updated this week
- Apache DataFusion Comet Spark Accelerator☆971Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆788Updated this week
- Boring Data Tool☆223Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Stream Arrow data into Postgres☆265Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆222Updated this week
- ☆292Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- SQLAlchemy driver for DuckDB☆431Updated this week
- A Postgres Proxy Server in Python☆285Updated 6 months ago
- Python bindings for sqlparser-rs☆191Updated last month
- Read Apache Arrow batches from ODBC data sources in Python☆65Updated 3 weeks ago
- A playground for running duckdb as a stateless query engine over a data lake☆202Updated last year