apache / datafusion-python
Apache DataFusion Python Bindings
☆375Updated this week
Related projects ⓘ
Alternatives and complementary repositories for datafusion-python
- Database connectivity API standard and libraries for Apache Arrow☆384Updated this week
- A native Delta implementation for integration with any query engine☆144Updated this week
- Distributed SQL Engine in Python using Dask☆397Updated 2 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆205Updated last month
- Apache PyIceberg☆473Updated this week
- Apache DataFusion Ray☆116Updated this week
- Apache Iceberg☆658Updated this week
- ☆159Updated last month
- Distributed SQL Query Engine in Python using Ray☆239Updated last month
- Ibis Substrait Compiler☆95Updated this week
- ☆242Updated 2 months ago
- Quickly view your data☆283Updated this week
- Boring Data Tool☆209Updated 7 months ago
- A native Rust library for Apache Hudi, with bindings into Python☆146Updated this week
- DuckDB extension for Delta Lake☆136Updated last week
- A purely experimental DuckDB Deltalake extension☆94Updated 2 weeks ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆74Updated last month
- Turning PySpark Into a Universal DataFrame API☆323Updated this week
- Apache DataFusion Comet Spark Accelerator☆821Updated this week
- LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.☆375Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆306Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆148Updated last week
- Apache DataFusion Ballista Distributed Query Engine☆1,549Updated this week
- Stream Arrow data into Postgres☆251Updated 7 months ago
- A command line tool to query an ODBC data source and write the result into a parquet file.☆223Updated this week
- Read Apache Arrow batches from ODBC data sources in Python☆58Updated 3 weeks ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,205Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆234Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆165Updated 2 weeks ago