danielgafni / dagster-ray
Ray integration for Dagster
☆23Updated this week
Related projects ⓘ
Alternatives and complementary repositories for dagster-ray
- A data modelling layer built on top of polars and pydantic☆197Updated last year
- [Project moved] Polars integration for Dagster☆37Updated 8 months ago
- ☆152Updated 3 weeks ago
- DuckDB extension for Delta Lake☆136Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆50Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆205Updated last month
- A playground for running duckdb as a stateless query engine over a data lake☆168Updated 10 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆93Updated last month
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆162Updated last week
- Write your dbt models using Ibis☆52Updated 3 weeks ago
- A purely experimental DuckDB Deltalake extension☆94Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆330Updated 7 months ago
- Turning PySpark Into a Universal DataFrame API☆318Updated last week
- A native Delta implementation for integration with any query engine☆143Updated this week
- Arrow, pydantic style☆82Updated last year
- Apache DataFusion Python Bindings☆375Updated this week
- Read Apache Arrow batches from ODBC data sources in Python☆57Updated 2 weeks ago
- A Postgres Proxy Server in Python☆252Updated last month
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆55Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆304Updated last year
- Polars plugin for stable hashing functionality☆57Updated last week
- LETSQL is a deferred compute system focused on Preprocessing for AI pipelines. Optimize performance with cross-engine caching and static …☆67Updated this week
- Pythonic Iceberg REST Catalog☆66Updated 2 months ago
- The Modern Data Stack in a Python package☆49Updated 11 months ago
- dbt's adapter for dremio☆48Updated 2 years ago
- The smallest DuckDB SQL orchestrator on Earth.☆171Updated last month
- Apache DataFusion Ray☆110Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆73Updated last month
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆81Updated this week
- ☆82Updated 6 months ago