Softlandia-Ltd / stateful-streaming-examples
Stateful streaming computations implemented with many technologies
☆31Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for stateful-streaming-examples
- IbisML is a library for building scalable ML pipelines using Ibis.☆95Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- ☆54Updated 10 months ago
- Apache DataFusion Python Bindings☆376Updated this week
- A kafka streams client library built on confluent-kafka-python☆68Updated last year
- The Modern Data Stack in a Python package☆49Updated 11 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Dask integration for Snowflake☆30Updated last week
- PySpark schema generator☆38Updated last year
- Prefect integrations for working with Docker☆43Updated 6 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 2 months ago
- ☆83Updated 6 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆177Updated 2 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆189Updated this week
- ☆76Updated last month
- A Minimalistic Rust Implementation of Delta Sharing Server.☆81Updated 3 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆171Updated 10 months ago
- ✨ A Pydantic to PySpark schema library☆56Updated this week
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆165Updated 2 weeks ago
- FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)☆67Updated this week
- ☆67Updated 2 weeks ago
- ☆16Updated 11 months ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆84Updated 2 weeks ago
- Deploy a Prefect flow to serverless AWS Lambda function☆36Updated 2 years ago
- Arrow, pydantic style☆82Updated last year
- ☆26Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- LETSQL is a deferred compute system focused on Preprocessing for AI pipelines. Optimize performance with cross-engine caching and static …☆68Updated this week
- Read Apache Arrow batches from ODBC data sources in Python☆58Updated 3 weeks ago
- Python stream processing for analytics☆24Updated this week