Softlandia-Ltd / stateful-streaming-examples
Stateful streaming computations implemented with many technologies
☆31Updated 9 months ago
Alternatives and similar repositories for stateful-streaming-examples:
Users that are interested in stateful-streaming-examples are comparing it to the libraries listed below
- ☆54Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆112Updated 10 months ago
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆68Updated 2 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆199Updated last week
- IbisML is a library for building scalable ML pipelines using Ibis.☆100Updated last month
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- ☆67Updated last week
- A kafka streams client library built on confluent-kafka-python☆67Updated last year
- ✨ A Pydantic to PySpark schema library☆69Updated this week
- ☆68Updated last month
- Deploy production-grade Metaflow cloud infrastructure on AWS☆61Updated last month
- Code examples showing flow deployment to various types of infrastructure☆104Updated 2 years ago
- Ray integration for Dagster☆36Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 5 months ago
- Read Delta tables without any Spark☆47Updated 11 months ago
- Kedro Plugin to support running workflows on GCP Vertex AI Pipelines☆35Updated 2 weeks ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆79Updated 2 years ago
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆49Updated 2 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- Prepare requirements and deploy Flyte using Helm☆62Updated last month
- Dask integration for Snowflake☆30Updated 3 months ago
- Apache DataFusion Python Bindings☆414Updated this week
- Ray provider for Apache Airflow☆47Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- PySpark schema generator☆41Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 6 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆192Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆181Updated last week