danielgafni / dagster-ray
Ray integration for Dagster
☆36Updated this week
Alternatives and similar repositories for dagster-ray:
Users that are interested in dagster-ray are comparing it to the libraries listed below
- Arrow, pydantic style☆82Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆106Updated 2 months ago
- Coming soon☆60Updated last year
- A data modelling layer built on top of polars and pydantic☆195Updated last year
- [Project moved] Polars integration for Dagster☆36Updated last year
- Polars plugin for stable hashing functionality☆66Updated 3 months ago
- Write your dbt models using Ibis☆64Updated this week
- Ray provider for Apache Airflow☆47Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆190Updated last year
- An fsspec implementation for the lakeFS project☆46Updated last week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆198Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- dbt-prql allows writing PRQL in dbt models☆104Updated 2 weeks ago
- Dagster SQLMesh Adapter☆44Updated this week
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆101Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆116Updated last month
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆139Updated last month
- Synchronicity lets you interoperate with asynchronous Python APIs.☆106Updated last month
- deferred computational framework for multi-engine pipelines☆105Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆344Updated 11 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆88Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- Dask integration for Snowflake☆30Updated 4 months ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆23Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- Apache DataFusion Python Bindings☆423Updated this week