lakehq / sail
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
☆601Updated this week
Alternatives and similar repositories for sail:
Users that are interested in sail are comparing it to the libraries listed below
- Apache Iceberg☆778Updated this week
- An extensible, state-of-the-art columnar file format☆1,070Updated this week
- Quickly view your data☆292Updated last week
- GlareDB: An analytics DBMS for distributed data☆752Updated this week
- Analytical database for data-driven Web applications 🪶☆458Updated this week
- A native Delta implementation for integration with any query engine☆174Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆377Updated this week
- Apache DataFusion Python Bindings☆400Updated this week
- Apache DataFusion Comet Spark Accelerator☆866Updated this week
- Apache DataFusion Ray☆139Updated 3 weeks ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆133Updated this week
- DuckDB for streaming data☆255Updated this week
- DuckDB extension for Delta Lake☆152Updated this week
- Boring Data Tool☆213Updated 9 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆210Updated this week
- New file format for storage of large columnar datasets.☆464Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆185Updated this week
- Embeddable stream processing engine based on Apache DataFusion☆308Updated last month
- PRQL as a DuckDB extension☆272Updated 4 months ago
- Turning PySpark Into a Universal DataFrame API☆349Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆315Updated last year
- Apache DataFusion Ballista Distributed Query Engine☆1,618Updated this week
- Columnstore Table in Postgres☆372Updated this week
- The Feldera Incremental Computation Engine☆978Updated this week
- High-performance diffing of large datasets across databases☆383Updated this week
- Ergonomic bindings to duckdb for Rust☆543Updated last week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆379Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 3 months ago
- Database connectivity API standard and libraries for Apache Arrow☆393Updated this week
- Maelstrom is a fast Rust, Go, and Python test runner that runs every test in its own container. Tests are either run locally or distribut…☆609Updated this week