apache / spark-connect-rustLinks
Apache Spark Connect for Rust
☆23Updated 3 weeks ago
Alternatives and similar repositories for spark-connect-rust
Users that are interested in spark-connect-rust are comparing it to the libraries listed below
Sorting:
- Apache Spark Connect Client for Rust☆114Updated 4 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆414Updated 5 months ago
- A native Delta implementation for integration with any query engine☆271Updated last week
- Compaction runtime for Apache Iceberg.☆100Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆221Updated last week
- Boring Data Tool☆235Updated last year
- Query Plan Markup Language☆45Updated last year
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆255Updated last week
- ☆59Updated 2 weeks ago
- Apache DataFusion Benchmarks☆22Updated 3 weeks ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆43Updated last year
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆294Updated this week
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆53Updated this week
- Apache DataFusion Ray☆221Updated 3 weeks ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆95Updated 7 months ago
- DataFusion TableProviders for reading data from other systems☆155Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆165Updated last week
- A BYOC option for Snowflake workloads☆101Updated this week
- TPC-H benchmark data generation in pure Rust☆202Updated last month
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,018Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,055Updated last week
- ☆107Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 6 months ago
- ☆14Updated 3 weeks ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆151Updated last month
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆74Updated this week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.☆231Updated 3 months ago
- Experimental support for serializing DataFusion plans using substrait☆46Updated 2 years ago