larribas / daggerLinks
Define sophisticated data pipelines with Python and run them on different distributed systems (such as Argo Workflows).
☆17Updated last year
Alternatives and similar repositories for dagger
Users that are interested in dagger are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- Robust data transformation tool using SQL☆21Updated 2 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 5 months ago
- Your go-to for easy access to a plethora of compression algorithms, all neatly bundled in one simple installation.☆118Updated last month
- Apache Arrow Ballista Python bindings☆37Updated last year
- Arrow, pydantic style☆84Updated 2 years ago
- Python binding for DataFusion☆59Updated 3 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 6 months ago
- A fast bloom filter implemented by Rust for Python! 10x faster than pybloom!☆101Updated this week
- Rust DataFusion Server☆20Updated last week
- A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)☆63Updated 3 months ago
- Apache Arrow Development Experiments☆23Updated 2 months ago
- A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between …☆61Updated 4 years ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆108Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- OpenTelemetry Extensions for Python☆12Updated 10 months ago
- A minimal Python library for Apache Arrow, connecting to the Rust arrow crate☆207Updated this week
- S3 as an ObjectStore for DataFusion☆65Updated 2 years ago
- Postgres protocol frontend for DataFusion☆80Updated this week
- Logging bridge from pyo3 native extension to python☆72Updated 3 months ago
- Official Python client SDK for Iggy.rs message streaming.☆27Updated 2 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆164Updated 2 months ago
- Blazing-fast JSON Schema inference engine built in Rust☆83Updated last year
- Example of using the Apache Arrow C Data Interface between Python and Rust☆23Updated last year
- Fluvio DuckDB Integration☆20Updated last year
- Stream processing & Service framework.☆154Updated last year
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆72Updated 2 weeks ago
- Cache the intermediate results of queries on timeseries data in DataFusion.☆18Updated 10 months ago
- An experimental (work-in-progress) statically typed implementation of Apache Arrow☆21Updated this week
- A Delta Lake reader for Dask☆53Updated last month