kwai / blazeLinks

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

☆1,506

Alternatives and similar repositories for blaze

Users that are interested in blaze are comparing it to the libraries listed below

Sorting:

apache / datafusion-comet
Apache DataFusion Comet Spark Accelerator
☆1,007Updated last week
apache / incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,404Updated this week
apache / iceberg-rust
Apache Iceberg
☆1,043Updated this week
apache / datafusion-ballista
Apache DataFusion Ballista Distributed Query Engine
☆1,807Updated this week
substrait-io / substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,362Updated last month
apache / celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆965Updated this week
linkedin / coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆846Updated 2 weeks ago
lakekeeper / lakekeeper
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
☆818Updated this week
apache / hudi-rs
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
☆239Updated last week
apache / uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
☆421Updated this week
projectnessie / nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,269Updated this week
apache / fluss
Apache Fluss is a streaming storage built for real-time analytics.
☆1,357Updated this week
apache / amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
☆1,027Updated this week
lakehq / sail
LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.
☆858Updated this week
oap-project / Gluten-Trino
Gluten: Plugin to Boost Trino's Performance
☆74Updated last year
facebookincubator / nimble
New file format for storage of large columnar datasets.
☆577Updated this week
apache / kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,226Updated this week
apache / polaris
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆1,602Updated this week
facebookincubator / velox
A composable and fully extensible C++ execution engine library for data management systems.
☆3,827Updated last week
delta-io / kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
☆410Updated 2 months ago
rewrite-bigdata-in-rust / RBIR
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆262Updated this week
apache / paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …
☆2,903Updated this week
apache / paimon-rust
Apache Paimon Rust The rust implementation of Apache Paimon.
☆129Updated 3 months ago
delta-io / delta-kernel-rs
A native Delta implementation for integration with any query engine
☆238Updated this week
GlareDB / glaredb
GlareDB: A light and fast SQL database for analytics
☆951Updated last week
nexmark / nexmark
Benchmarks for queries over continuous data streams.
☆354Updated 7 months ago
andygrove / how-query-engines-work
This is the companion repository for the book How Query Engines Work.
☆396Updated 2 years ago
ClickHouse / ClickBench
ClickBench: a Benchmark For Analytical Databases
☆853Updated this week
apache / datafusion-ray
Apache DataFusion Ray
☆214Updated 3 months ago
timeplus-io / proton
A high-performance SQL engine written in C++, designed for real-time data processing. It can read millions of rows per second from ClickH…
☆1,859Updated 3 weeks ago