Softlandia-Ltd / stateful-streaming-examplesLinks
Stateful streaming computations implemented with many technologies
☆32Updated last year
Alternatives and similar repositories for stateful-streaming-examples
Users that are interested in stateful-streaming-examples are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆530Updated last week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 3 weeks ago
- ✨ A Pydantic to PySpark schema library☆112Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆229Updated last month
- Delta Acceptance Testing☆21Updated 3 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated 3 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Distributed SQL Engine in Python using Dask☆408Updated last year
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆255Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆346Updated 3 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆214Updated last year
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated 2 weeks ago
- ☆70Updated 11 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆118Updated 4 months ago
- fsspec-compatible Azure Datake and Azure Blob Storage access☆201Updated 2 weeks ago
- Read Delta tables without any Spark☆47Updated last year
- A data modelling layer built on top of polars and pydantic☆198Updated 2 years ago
- high-level expressions for multi-engine compute☆462Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆94Updated 9 months ago
- The Modern Data Stack in a Python package☆49Updated 2 years ago
- Turning PySpark Into a Universal DataFrame API☆457Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆332Updated 2 years ago
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆235Updated 10 months ago
- Making data lake work for time series☆1,185Updated last year
- Stream Arrow data into Postgres☆274Updated 4 months ago
- PySpark schema generator☆43Updated 2 years ago
- SQLAlchemy driver for DuckDB☆474Updated this week
- ☆81Updated 9 months ago
- Ray integration for Dagster☆61Updated this week