MrDataPsycho / data-pipelines-in-rust
Data pipeline example written in Rust with Polars and DataFusion DataFrame package
☆40Updated 2 years ago
Alternatives and similar repositories for data-pipelines-in-rust
Users that are interested in data-pipelines-in-rust are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 years ago
- ☆21Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Back-end implementation of the Open Data Fabric protocol☆17Updated this week
- Tantivy directory implementation backed by object_store☆33Updated last year
- S3 as an ObjectStore for DataFusion☆62Updated 2 years ago
- dbt-databend adapter plugin☆10Updated 11 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆153Updated this week
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆68Updated 2 weeks ago
- Python binding for DataFusion☆59Updated 2 years ago
- Serving any JSON/CSN/Parquet/Arrow files like Postgres tables with Datafusion☆38Updated last week
- JSON support for DataFusion (unofficial)☆40Updated 2 weeks ago
- A presto/trino client library written in rust.☆42Updated 4 months ago
- A text embedding extension for the Polars Dataframe library.☆24Updated 5 months ago
- Robust data transformation tool using SQL☆21Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 7 months ago
- ☆91Updated 2 weeks ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆90Updated last month
- Rust DataFusion Server☆16Updated 2 weeks ago
- Apache Spark Connect Client for Rust☆107Updated 2 weeks ago
- Databend Native Client☆57Updated this week
- Fluvio DuckDB Integration☆20Updated last year
- A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between …☆61Updated 4 years ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆67Updated last week
- Apache Arrow Ballista Python bindings☆37Updated last year
- Rust SDK for Apache Avro - a data serialization system.☆55Updated last week
- DataFusion TableProviders for reading data from other systems☆116Updated this week
- Rust based high-performance Apache Uniffle shuffle-server☆29Updated this week