sjrusso8 / spark-connect-rs
Apache Spark Connect Client for Rust
☆90Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for spark-connect-rs
- Open, Multi-modal Catalog for Data & AI, written in Rust☆74Updated last month
- Apache DataFusion Ray☆118Updated this week
- A native Delta implementation for integration with any query engine☆146Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆109Updated this week
- A native Rust library for Apache Hudi, with bindings into Python☆149Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆235Updated this week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆82Updated 3 months ago
- ☆32Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆370Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆192Updated this week
- Pure Rust Iceberg Implementation☆166Updated 3 months ago
- ☆21Updated 2 years ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆77Updated last week
- Pythonic Iceberg REST Catalog☆67Updated 2 months ago
- Apache Iceberg☆674Updated this week
- Experimental support for serializing DataFusion plans using substrait☆44Updated last year
- An opinionated and batteries included DataFusion implementation.☆115Updated this week
- DataFusion TableProviders for reading data from other systems☆62Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆100Updated last month
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆37Updated 2 months ago
- ☆33Updated last year
- Boring Data Tool☆210Updated 8 months ago
- LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.☆447Updated this week
- ☆26Updated 5 months ago
- ☆160Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- A purely experimental DuckDB Deltalake extension☆94Updated 2 weeks ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 5 months ago
- Apache DataFusion Python Bindings☆376Updated last week