MrDataPsycho / data-pipelines-in-rust
Data pipeline example written in Rust with Polars and DataFusion DataFrame package
☆40Updated 2 years ago
Alternatives and similar repositories for data-pipelines-in-rust:
Users that are interested in data-pipelines-in-rust are comparing it to the libraries listed below
- ☆21Updated 11 months ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆67Updated last week
- ☆22Updated 3 years ago
- S3 as an ObjectStore for DataFusion☆61Updated 2 years ago
- JSON support for DataFusion (unofficial)☆38Updated last week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆89Updated last month
- Robust data transformation tool using SQL☆21Updated 2 years ago
- Apache Arrow Ballista Python bindings☆37Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆149Updated last week
- A text embedding extension for the Polars Dataframe library.☆24Updated 5 months ago
- Tantivy directory implementation backed by object_store☆32Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆11Updated 2 months ago
- Rust SDK for Apache Avro - a data serialization system.☆55Updated last week
- ☆18Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- ☆89Updated 3 weeks ago
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆40Updated this week
- DataFusion TableProviders for reading data from other systems☆105Updated this week
- dbt-databend adapter plugin☆10Updated 10 months ago
- A DataFusion-powered Serverless S3 Proxy.☆16Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Updated last year
- Rust DataFusion Server☆16Updated last month
- Serving any JSON/CSN/Parquet/Arrow files like Postgres tables with Datafusion☆30Updated last week
- A Rust based deduplication tool☆34Updated 3 months ago
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆16Updated 4 months ago
- A presto/trino client library written in rust.☆41Updated 4 months ago
- Framework to build data pipelines declaratively☆50Updated last month