kwai / auronLinks

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

☆1,528

Alternatives and similar repositories for auron

Users that are interested in auron are comparing it to the libraries listed below

Sorting:

apache / datafusion-comet
Apache DataFusion Comet Spark Accelerator
☆1,028Updated this week
apache / incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,419Updated this week
apache / iceberg-rust
Apache Iceberg
☆1,059Updated this week
apache / datafusion-ballista
Apache DataFusion Ballista Distributed Query Engine
☆1,821Updated this week
substrait-io / substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,368Updated last week
linkedin / coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆854Updated last month
apache / celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆974Updated this week
projectnessie / nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,289Updated this week
apache / hudi-rs
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
☆247Updated this week
lakekeeper / lakekeeper
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
☆856Updated last week
apache / uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
☆425Updated last week
facebookincubator / nimble
New file format for storage of large columnar datasets.
☆586Updated 2 weeks ago
apache / polaris
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆1,624Updated last week
apache / fluss
Apache Fluss is a streaming storage built for real-time analytics.
☆1,403Updated this week
lakehq / sail
LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.
☆886Updated this week
apache / amoro
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,039Updated last week
oap-project / Gluten-Trino
Gluten: Plugin to Boost Trino's Performance
☆74Updated last year
apache / kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,232Updated last week
delta-io / delta-kernel-rs
A native Delta implementation for integration with any query engine
☆246Updated last week
apache / datafusion-ray
Apache DataFusion Ray
☆217Updated 2 weeks ago
nexmark / nexmark
Benchmarks for queries over continuous data streams.
☆357Updated 8 months ago
apache / gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
☆1,765Updated last week
facebookincubator / velox
A composable and fully extensible C++ execution engine library for data management systems.
☆3,861Updated this week
apache / paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …
☆2,939Updated this week
apache / paimon-rust
Apache Paimon Rust The rust implementation of Apache Paimon.
☆129Updated 4 months ago
delta-io / kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
☆411Updated 3 months ago
GlareDB / glaredb
GlareDB: A light and fast SQL database for analytics
☆961Updated last week
rlink-rs / rlink-rs
High-performance Stream Processing Framework. An alternative to Apache Flink.
☆467Updated last year
rewrite-bigdata-in-rust / RBIR
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆278Updated this week
apache / iceberg-docs
Apache Iceberg Documentation Site
☆42Updated last year