MrDataPsycho / data-pipelines-in-rust
Data pipeline example written in Rust with Polars and DataFusion DataFrame package
☆39Updated last year
Related projects ⓘ
Alternatives and complementary repositories for data-pipelines-in-rust
- JSON support for DataFusion (unofficial)☆28Updated last week
- ☆21Updated 6 months ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆9Updated 8 months ago
- ☆21Updated 2 years ago
- Experimental support for serializing DataFusion plans using substrait☆44Updated last year
- A batteries included data processing and DataFusion development app for the terminal☆111Updated this week
- Rust lib to read from Apache ORC☆18Updated last year
- Tantivy directory implementation backed by object_store☆27Updated 9 months ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆50Updated this week
- Robust data transformation tool using SQL☆20Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆58Updated this week
- µWheel DataFusion Optimizer for speeding up time-based analytics☆26Updated 2 months ago
- S3 as an ObjectStore for DataFusion☆59Updated last year
- Rust FFI example binding for chDB, an embedded SQL Engine powered by ClickHouse☆29Updated 2 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆81Updated 3 months ago
- Apache Arrow Ballista Python bindings☆33Updated 8 months ago
- A presto/trino client library written in rust.☆40Updated 3 weeks ago
- ☆76Updated last month
- Rust DataFusion Server☆11Updated 2 weeks ago
- memchr vs stringzilla - up to 7x throughput difference between two SIMD-accelerated substring search libraries in Rust☆45Updated 6 months ago
- Make ETLs Great Again!☆42Updated last year
- dbt-databend adapter plugin☆10Updated 5 months ago
- Apache Spark Connect Client for Rust☆90Updated last week
- Derive for arrow2☆65Updated last year
- A workflow scheduler based on petri-nets☆68Updated last month
- Rust SDK for Apache Avro - a data serialization system.☆21Updated this week
- A User-Defined Function Framework for Apache Arrow.☆76Updated last week
- Bigtable data source for Apache Arrow DataFusion☆23Updated 2 years ago