risingwavelabs / awesome-stream-processing
A collection of demos showcasing how stream processing can be used to solve real-world problems.
☆188Updated last week
Alternatives and similar repositories for awesome-stream-processing:
Users that are interested in awesome-stream-processing are comparing it to the libraries listed below
- Embeddable stream processing engine based on Apache DataFusion☆333Updated 4 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆235Updated this week
- DuckDB for streaming data☆466Updated this week
- 10x lower latency for cloud-native DataFusion☆134Updated this week
- Apache DataFusion Ray☆184Updated 3 weeks ago
- DuckDB-powered analytics in Postgres☆152Updated 10 months ago
- In-Memory Analytics for Kafka using DuckDB☆116Updated this week
- View parquet files online☆148Updated last week
- Embeddable Cloud-Native Key-Value Storage.☆92Updated 2 months ago
- New file format for storage of large columnar datasets.☆532Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆166Updated 2 weeks ago
- Message queue and data streaming based on cloud native services.☆109Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆239Updated 2 weeks ago
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆144Updated 3 months ago
- Pure Rust Iceberg Implementation☆163Updated 8 months ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆727Updated this week
- TPC-H benchmark data generation in pure Rust☆53Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆209Updated this week
- CMU-DB's Cascades optimizer framework☆397Updated 3 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- Analytical database for data-driven Web applications 🪶☆482Updated 2 months ago
- deferred computational framework for multi-engine pipelines☆242Updated this week
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆54Updated 11 months ago
- Multi-hop declarative data pipelines☆114Updated this week
- A native Delta implementation for integration with any query engine☆223Updated this week
- GigAPI: DuckDB + Parquet Query Engine & Cloud-Native Storage API☆228Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆124Updated last month
- A Benchmark for Real-Time Analytics Applications☆61Updated 2 weeks ago
- Unified MySQL, Postgres & FlightSQL Server, Powered by DuckDB.☆434Updated 3 months ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆117Updated this week