MrDataPsycho / data-pipelines-in-rustLinks
Data pipeline example written in Rust with Polars and DataFusion DataFrame package
☆41Updated 2 years ago
Alternatives and similar repositories for data-pipelines-in-rust
Users that are interested in data-pipelines-in-rust are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- Tantivy directory implementation backed by object_store☆36Updated last year
- dbt-databend adapter plugin☆10Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 11 months ago
- ☆105Updated 2 weeks ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆164Updated 2 months ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆72Updated last week
- ☆23Updated 3 years ago
- Python binding for DataFusion☆59Updated 3 years ago
- DataFusion TableProviders for reading data from other systems☆142Updated this week
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Robust data transformation tool using SQL☆21Updated 2 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 6 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆143Updated 3 weeks ago
- Example of using the Apache Arrow C Data Interface between Python and Rust☆23Updated last year
- S3 as an ObjectStore for DataFusion☆65Updated 2 years ago
- JSON support for DataFusion (unofficial)☆47Updated last month
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆76Updated this week
- Postgres protocol frontend for DataFusion☆82Updated this week
- Databend Native Client☆59Updated last week
- Framework to build data pipelines declaratively☆92Updated this week
- Apache Spark Connect Client for Rust☆112Updated 3 months ago
- Minimal, exact vector search with metadata filtering. Think "Polars for vector search."☆28Updated last week
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Rust DataFusion Server☆20Updated 3 weeks ago
- A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between …☆61Updated 4 years ago
- WASM bindings for DataFusion☆26Updated 4 months ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 7 months ago
- Embeddable Aggregate Management System for Streams and Queries.☆96Updated 4 months ago
- Apache Arrow Ballista Python bindings☆37Updated last year