Softlandia-Ltd / stateful-streaming-examplesLinks
Stateful streaming computations implemented with many technologies
☆31Updated last year
Alternatives and similar repositories for stateful-streaming-examples
Users that are interested in stateful-streaming-examples are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆508Updated last week
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated last week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Distributed SQL Engine in Python using Dask☆407Updated last year
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last month
- SQLAlchemy driver for DuckDB☆464Updated this week
- ☆58Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆220Updated this week
- Making data lake work for time series☆1,182Updated last year
- Code examples showing flow deployment to various types of infrastructure☆110Updated 2 years ago
- ☆70Updated 9 months ago
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- ☆81Updated 7 months ago
- ✨ A Pydantic to PySpark schema library☆107Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆117Updated 2 months ago
- fsspec-compatible Azure Datake and Azure Blob Storage access☆198Updated last week
- Turning PySpark Into a Universal DataFrame API☆437Updated this week
- The Modern Data Stack in a Python package☆49Updated last year
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆245Updated this week
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆233Updated 8 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Metadata tracking and UI service for Metaflow!☆214Updated 5 months ago
- multi-engine batch transformation framework☆448Updated last week
- A data modelling layer built on top of polars and pydantic☆198Updated 2 years ago
- A data modelling layer built on top of polars and pydantic☆548Updated last month
- Joining the modern data stack with the modern ML stack☆200Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- ☆158Updated 4 months ago