Softlandia-Ltd / stateful-streaming-examplesLinks
Stateful streaming computations implemented with many technologies
☆31Updated last year
Alternatives and similar repositories for stateful-streaming-examples
Users that are interested in stateful-streaming-examples are comparing it to the libraries listed below
Sorting:
- A kafka streams client library built on confluent-kafka-python☆67Updated last year
- PySpark schema generator☆43Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- ☆58Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 3 months ago
- Ray provider for Apache Airflow☆48Updated last year
- A Table format agnostic data sharing framework☆38Updated last year
- Build reliable AI and agentic applications with DataFrames☆73Updated this week
- Delta Acceptance Testing☆20Updated 11 months ago
- Apache DataFusion Python Bindings☆460Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆206Updated last year
- ☆70Updated 5 months ago
- ☆33Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆82Updated 4 months ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆81Updated 3 years ago
- Ray integration for Dagster☆45Updated this week
- ✨ A Pydantic to PySpark schema library☆96Updated this week
- Read Delta tables without any Spark☆47Updated last year
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆76Updated last month
- The Modern Data Stack in a Python package☆49Updated last year
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last week
- Dask integration for Snowflake☆30Updated 7 months ago
- Ray-based Apache Beam runner☆42Updated last year
- Coming soon☆61Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated last week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 10 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week