Softlandia-Ltd / stateful-streaming-examples
Stateful streaming computations implemented with many technologies
☆31Updated 11 months ago
Alternatives and similar repositories for stateful-streaming-examples:
Users that are interested in stateful-streaming-examples are comparing it to the libraries listed below
- ☆57Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆193Updated last year
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆80Updated 2 years ago
- Apache DataFusion Python Bindings☆433Updated this week
- Ray provider for Apache Airflow☆48Updated last year
- ByteHub: making feature stores simple☆60Updated 3 years ago
- A kafka streams client library built on confluent-kafka-python☆67Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆201Updated this week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- ✨ A Pydantic to PySpark schema library☆81Updated this week
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☆68Updated 3 months ago
- Prepare requirements and deploy Flyte using Helm☆66Updated last month
- Arrow, pydantic style☆82Updated 2 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆78Updated 8 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆212Updated this week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆70Updated last month
- deferred computational framework for multi-engine pipelines☆220Updated this week
- DVC support for Airflow workflows☆6Updated 2 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 7 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆100Updated last week
- PySpark schema generator☆42Updated 2 years ago
- Dask integration for Snowflake☆30Updated 4 months ago
- Deploy production-grade Metaflow cloud infrastructure on AWS☆65Updated 3 months ago
- Ray-based Apache Beam runner☆43Updated last year
- Ray integration for Dagster☆37Updated last week